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Preface 


There have been two revolutions in the way we view the physical world in the 
twentieth century: relativity and quantum mechanics. In quantum mechanics the 
revolution has been both profound—requiring a dramatic revision in the structure of 
the laws of mechanics that govern the behavior of all particles, be they electrons 
or photons—and far-reaching in its impact-—determining the stability of matter 
itself, shaping the interactions of particles on the atomic, nuclear, and particle 
physics level, and leading to macroscopic quantum effects ranging from lasers and 
superconductivity to neutron stars and radiation from black holes. Moreover, in a 
triumph for twentieth-century physics, special relativity and quantum mechanics 
have been joined together in the form of quantum field theory. Field theories such as 
quantum electrodynamics have been tested with an extremely high precision, with 
agreement between theory and experiment verified to better than nine significant 
figures. It should be emphasized that while our understanding of the laws of physics 
is continually evolving, always being subjected to experimental scrutiny, so far no 
confirmed discrepancy between theory and experiment for quantum mechanics has 
been detected. 

This book is intended for an upper-division course in quantum mechanics. The 
most likely audience for the book consists--nf students who have completed a course in 
modem physics that includes an introduction to quantum mechanics that emphasizes 
wave mechanics. Rather than continue with a similar approach in a second course, I 
have chosen to introduce the fundamentals of quantum mechanics through a detailed 
discussion of the physics of intrinsic spin. Such an approach has a number of 
significant advantages. First, students find starting a course with something “new” 
such as intrinsic spin both interesting and exciting, and they enjoy making the 
connections with what they have seen before. Second, spin systems provide us with 
many beautiful but straightforward illustrations of the essential structure of quantum 
mechanics, a structure that is not obscured by the mathematics of wave mechanics. 
Quantum mechanics can be presented through concrete examples. I believe that most 
physicists leam through specific examples and then find it easy to generalize. By 
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starting with spin, students are given plenty of time to assimilate this novel and 
striking material. I have found that they seem to learn this key introductory material 
easily and well—material that was often perceived to be difficult when I came to it 
midway through a course that began with wave mechanics. Third, when we do come 
to wave mechanics, students see that wave mechanics is only one aspect of quantum 
mechanics, not the fundamental core of the subject. They see at an early stage that 
wave mechanics and matrix mechanics are just different ways of calculating based 
on the same underlying quantum mechanics and that the approach they use depends 
on the particular problem they are addressing. 

I have been inspired by two sources, an “introductory” treatment in Volume III 
of The Feynman Lectures on Physics and an advanced exposition in J. J. Sakurai’s 
Modem Quantum Mechanics. Overall, I believe that wave mechanics is probably 
the best way to introduce students to quantum mechanics. Wave mechanics makes 
the largest overlap with what students know from classical mechanics and shows 
them the strange behavior of quantum mechanics in a familiar environment. This 
is probably why students find their first introduction to quantum mechanics so 
stimulating. However, starting a second course with wave mechanics runs the risk 
of diminishing much of the excitement and enthusiasm for the entirely new way of 
viewing nature that is demanded by quantum mechanics. It becomes sort of old hat, 
material the students has seen before, repeated in more depth. It is, I believe, with the 
second exposure to quantum mechanics that something like Feynman’s approach has 
its best chance to be effective. But to be effective, a quantum mechanics text needs 
to make lots of contact with the way most physicists think and calculate in quantum 
mechanics using the language of kets and operators. This is Sakurai’s approach in 
his graduate-level textbook. In a sense, the approach that I am presenting here can 
be viewed as a superposition of these two approaches, but at the junior-senior level. 

Chapter 1 introduces the concepts of the quantum state vector, complex proba¬ 
bility amplitudes, and the probabilistic interpretation of quantum mechanics in the 
context of analyzing a number of Stem-Gerlach experiments carried out with spin¬ 
or particles. By introducing ket vectors at the beginning, we have the framework for 
thinking about states as having an existence quite apart from the way we happen to 
choose to represent them, whether it be with matrix mechanics, which is discussed 
at length in Chapter 2, or, where appropriate, with wave mechanics, which is in¬ 
troduced in Chapter 6. Moreover, there is a natural role for operators; in Chapter 2 
they rotate spin states so that the spin “points” in a different direction. I do not fol¬ 
low a postulatory approach, but rather I allow the basic physics of this spin system 
to drive the introduction of concepts such as Hermitian operators, eigenvalues, and 
eigenstates. 

In Chapter 3 the commutation relations of the generators of rotations are deter¬ 
mined from the behavior of ordinary vectors under rotations. Most of the material 
in this chapter is fairly conventional; what is not so conventional is the introduc- 
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tion of operator techniques for determining the angular momentum eigenstates and 
eigenvalue spectrum and the derivation of the uncertainty relations from the com¬ 
mutation relations at such an early stage. Since so much of our initial discussion 
of quantum mechanics revolves around intrinsic spin, it is important for students to 
see how quantum mechanics can be used to determine from first principles the spin 
states that have been introduced in Chapters 1 and 2, without having to appeal only 
to experimental results. 

Chapter 4 is devoted to time evolution of states. The natural operation in time 
development is to translate states forward in time. The Hamiltonian enters as the 
generator of time translations, and the states are shown to obey the Schrodinger 
equation. Most of the chapter is devoted to physical examples. In Chapter 5 another 
physical system, the spin-spin interaction of an electron and proton in the ground 
state of hydrogen, is used to introduce the spin states of two spin-! particles. The 
total-spin-0 state serves as the basis for a discussion of the Einstein-Podolsky-Rosen 
(EPR) paradox and the Bell inequalities. 

The main theme of Chapter 6 is making contact with the usual formalism of wave 
mechanics. The special problems in dealing with states such as position and momen¬ 
tum states that have a continuous eigenvalue spectrum are analyzed. The momentum 
operator enters naturally as the generator of translations. Sections 6.8 through 6.10 
include a general discussion with examples of solutions to the Schrodinger equation 
that can serve as a review for students with a good background in one-dimensional 
wave mechanics. 

Chapter 7 is devoted to the one-dimensional simple harmonic oscillator, which 
merits a chapter all its own. Although the material in Chapter 8 on path integrals 
can be skipped without affecting subsequent chapters (with the exception of Sec¬ 
tion 14.1, on the Aharonov-Bohm effect), I believe that path integrals should be 
discussed, if possible, since this formalism provides real insight into quantum dy¬ 
namics. However, I have found it difficult to fit this material into our one-semester 
course, which is taken by all physics majors as well as some students majoring in 
other disciplines. Rather, I have choserrto postpone path integrals to a second course 
and then to insert the material in Chapter 8 before Chapter 14. Incidentally, the ma¬ 
terial on path integrals is the only part of the book that may require students to have 
had an upper-division classical mechanics course, one in which the principle of least 
action is discussed. 

Chapters 9 through 13 cover fully three-dimensional problems, including the 
two-body problem, orbital angular momentum, central potentials, time-independent 
perturbations, identical particles, and scattering. An effort has been made to include 
as many physical examples as possible. 

Although this is a textbook on nonrelativistic quantum mechanics, I have chosen 
to include a discussion of the quantized radiation held in the final chapter, Chapter 14. 
The use of ket and bra vectors from the beginning and the discussion of solutions 
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to problems such as angular momentum and the harmonic oscillator in terms of 
abstract raising and lowering operators should have helped to prepare the student 
for the exciting jump to a quantized electromagnetic field. By quantizing this field, 
we can really understand the properties of photons, we can calculate the lifetimes for 
spontaneous emission from first principles, and we can understand why a laser works. 
By looking at higher order processes such as photon-atom scattering, we can also see 
the essentials of Feynman diagrams. Although the atom is treated nonrelativistically, 
it is still possible to gain a sense of what quantum field theory is all about at this level 
without having to face the complications of the relativistic Dirac equation. For the 
instructor who wishes to cover time-dependent perturbation theory but does not have 
time for all of the chapter, Section 14.5 stands on its own. 

Although SI units are the standard for undergraduate education in electricity 
and magnetism, I have chosen in the text to use Gaussian units, which are more 
commonly used to describe microscopic phenomena. Fiowever. with the possible 
exception of the last chapter, with its quantum treatment of the electromagnetic field, 
the choice of units has little impact. My own experience suggests that students who 
are generally at home with SI units are comfortable (as indicated in a number of 
footnotes through the text) replacing e 2 with e 2 or ignoring the factor of c 
in the Bohr magneton whenever they need to carry out numerical calculations. In 
addition, electromagnetic units are discussed in Appendix A. 

In writing the second edition, I have added two sections to Chapter 5, one on 
entanglement and quantum teleportation and the other on the density operator. Given 
the importance of entanglement in quantum mechanics, it may seem strange, as it 
does to me now, to have written a quantum mechanics textbook without explicit use 
of the word entanglement. The concept of entanglement is, of course, at the heart 
of the discussion of the EPR paradox, which focused on the entangled state of two 
spin-1 particles in a spin-singlet state. Nonetheless, it wasn’t until the early 1990s, 
when topics such as quantum teleportation came to the fore, that the importance of 
entanglement as a fundamental resource that can be utilized in novel ways was fully 
appreciated and the term entanglement began to be widely used. I am also somewhat 
embarrassed not to have included a discussion of the density operator in the first 
edition. Unlike a textbook author, the experimentalist does not necessarily have the 
luxury of being able to focus on pure states. Thus there is good reason to introduce 
the density operator (and the density matrix) as a systematic way to deal with mixed 
states as well as pure states in quantum mechanics. I have added a section on coherent 
states of the harmonic oscillator to Chapter 7. Coherent states were first derived by 
Schrodinger in his efforts to find states that satisfy the correspondence principle. 
The real utility of these states is most apparent in Chapter 14, where it is seen that 
coherent states come closest to representing classical electromagnetic waves with a 
well-defined phase. I have also added a section to Chapter 14 on cavity quantum 
electrodynamics, showing how the interaction of the quantized electromagnetic 
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field with atoms is modified by confinement in a reflective cavity. Like quantum 
teleportation, cavity quantum electrodynamics is a topic that really came to the fore 
in the 1990s. In addition to these new sections, I have added numerous worked 
example problems to the text, with the hope that these examples will help students 
in mastering quantum mechanics. I have also increased the end-of-chapter problems 
by 25 percent. 

There is almost certainly enough material here for a full-year course. For a one- 
semester course, I have covered the material through Chapter 12, omitting Sections 
6.7 through 6.10 and, as noted earlier, Chapter 8. The material in the latter half of 
Chapter 6 is covered thoroughly in our introductory course on quantum physics. See 
John S. Townsend, Quantum Physics: A Fundamental Approach to Modern Physics, 
University Science Books, 2010. In addition to Chapter 8, other sections that might 
be omitted in a one-semester course include parts of Chapter 5, Section 9.7, and 
Sections 11.5 through 11.9. Or one might choose to go as far as Chapter 10 and 
reserve the remaining material for a later course. 

A comprehensive solutions manual for the instructor is available from the pub¬ 
lisher, upon request of the instructor. 

Finally, some grateful acknowledgments are certainly in order. Students in my 
quantum mechanics classes have given me useful feedback as I have taught from the 
book over the years. Colleagues at Harvey Mudd College who have offered valuable 
comments as well as encouragement include Bob Cave, Chih-Yung Chen, Tom Don¬ 
nelly, Tom Helliwell, Theresa Lynn, and Peter Saeta. Art Weldon of West Virginia 
University suggested a number of ways to improve the accuracy and effectiveness 
of the first edition. This text was initially published in the McGraw-Hill Interna¬ 
tional Series in Pure and Applied Physics. I have benefited from comments from the 
following reviewers: William Dalton, St. Cloud State University; Michael Grady, 
SUNY-Fredonia; Richard Hazeltine, University of Texas at Austin; Jack Mochel, 
University of Illinois at Urbana-Champaign; and Jae Y. Park, North Carolina State 
University. For the first edition, the Pew Science Program provided support for Doug 
Dunston and Doug Ridgway, two Harvey Mudd College students, who helped in the 
preparation of the text and figures, respectively, and Helen White helped in checking 
the galley proofs. A number of people have kindly given me feedback on the material 
for the second edition, including Rich Holman, Carnegie Mellon University; Randy 
Hulet, Rice University; Jim Napolitano, RPI; Tom Moore and David Tanenbaum, 
Pomona College; and John Taylor, University of Colorado. 

I have been fortunate to have the production of the book carried out by a very 
capable group of individuals headed by Paul Anagnostopoulos, the project manager. 
In addition to Paul, 1 want to thank Lee Young for copy editing, Joe Snowden for 
entering the copyedits and laying out the pages, Tom Webster for the artwork, 
MaryEllen Oliver for her amazingly thorough job of proofreading, Yvonne Tsang 
for text design, and Genette Itoko McGrew for her creative cover design. I also wish 


Page 13 (metric system) 



xvi | Preface 


to thank Jane Ellis and Brace Armbraster of University Science Books not only 
for their assistance but also for the care and attention to detail they have taken in 
preparing this new edition of the book. And I especially want to thank ray wife, 
Ellen, for cheerfully letting me devote so much time to this project. 

Please do not hesitate to contact me if you find errors or have suggestions that 
might improve the book. 

John S. Townsend 
Department of Physics 
Harvey Mudd College 
Claremont, CA 91711 
townsend @ hmc.edu 
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CHAPTER 1 


Stern-Gerlach Experiments 


We begin our discussion of quantum mechanics with a conceptually simple experi¬ 
ment in which we measure a component of the intrinsic spin angular momentum of 
an atom. This experiment was first carried out by O. Stem and W. Gerlach in 1922 
using a beam of silver atoms. We will refer to the measuring apparatus as a Stern- 
Gerlach device. The results of experiments with a number of such devices are easy 
to describe but, as we shall see, nonetheless startling in their consequences. 


1.1 The Original Stern-Gerlach Experiment 


Before analyzing the experiment, we need to know something about the relationship 
between the intrinsic spin angular momentum of a particle and its corresponding 
magnetic moment. To the classical physicist, angular momentum is always orbital 
angular momentum, namely, L = r x p. Although the Earth is said to have spin 
angular momentum ho due to its rotation about its axis as well as orbital angular 
momentum due to its revolution about the Sun, both types of angular momentum are 
just different forms of L. The intrinsic spin angular momentum S of a microscopic 
particle is not at all of the same sort as orbital angular momentum, but it is real 
angular momentum nonetheless. 

To get a feeling for the relationship that exists between the angular momentum of 
a charged particle and its corresponding magnetic moment, we first use a classical 
example and then point out some of its limitations. Consider a point particle with 
charge q and mass m moving in a circular orbit of radius r with speed v. The magnetic 
moment ,/jl is given by 


ft = 


LA 

c 


Tj c 


qvr = q L 

2c 2 me 


( 1 . 1 ) 

1 
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where ,4 is the area of the circle formed by the orbit, the current / is the charge q 
divided by the period T = ( 2nr/v ), and L—mvr is the orbital angular momentum 
of the particle. 1 Since the magnetic moment and the orbital angular momentum are 
parallel or antiparallel depending on the sign of the charge q , we may express this 
relationship in the vector form 



This relationship between L and p, turns out to be generally true whenever the mass 
and charge coincide in space. One can obtain different constants of proportionality 
by adjusting the charge and mass distributions independently. For example, a solid 
spherical ball of mass m rotating about an axis through its center with the charge q 
distributed uniformly only on the surface of the ball has a constant of proportionality 
of 5q/6mc. 

When we come to intrinsic spin angular momentum of a particle, we write 

H = — S (1.3) 

2 me 

where the value of the constant g is experimentally determined to be g = 2.00 for 
an electron, g — 5.58 for a proton, or even g = —3.82 for a neutron. 2 3 One might be 
tempted to presume that g is telling us about how the charge and mass are distributed 
for the different particles and that intrinsic spin angular momentum is just orbital 
angular momentum of the particle itself as it spins about its axis. We will see as we 
go along that such a simple classical picture of intrinsic spin is entirely untenable 
and that the intrinsic spin angular momentum we are discussing is a very different 
beast indeed. In fact, it appears that even a point particle in quantum mechanics may 
have intrinsic spin angular momentum. 1 Although there are no classical arguments 
that we can give to justify (1.3), we can note that such a relationship between the 


1 If you haven’t seen them before, the Gaussian units we are using for electromagnetism may 
take a little getting used to. A comparison of SI and Gaussian units is given in Appendix A. In 
SI units the magnetic moment is just I A, so you can ignore the factor of c, the speed of light, in 
expressions such as (1.1) if you wish to convert to SI units. 

2 Each of these g factors has its own experimental uncertainty. Recent measurements by B. 
Odom, D. Hanneke, B. D’Urso, and G. Gabrielse. Phys. Rev. Lett. 97, 030801 (2006), have shown 
that g/2 for an electron is 1.00115965218085(76), where the factor of 76 reflects the uncertainty 
in the last two places. Relativistic quantum mechanics predicts that g = 2 for an electron. The 
deviations from this value can be accounted for by quantum field theory. The much larger deviations 
from g — 2 for the proton and the (neutral) neutron are due to the fact that these particles are not 
fundamental but are composed of charged constituents called quarks. 

3 It is amusing to note that in 1925 S. Goudsmit and G. Uhlenbeck as graduate students 
“discovered" the electron’s spin from an analysis of atomic spectra. They were trying to understand 
why the optical spectra of alkali atoms such as sodium are composed of a pair of closely spaced 
lines, such as the sodium doublet. Goudsmit and Uhlenbeck realized that an additional degree of 
freedom (an independent coordinate) was required, a degree of freedom that they could understand 
only if they assumed the electron was a small ball of charge that could rotate about an axis. 
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(a) 



(b) 


Figure 1.1 (a) A schematic diagram of the Stern-Gerlach experiment, (b) A cross-sectional 
view of the pole pieces of the magnet depicting the inhomogeneous magnetic field they 
produce. 


magnetic moment and the intrinsic spin angular momentum is at least consistent with 
dimensional analysis. At this stage, you can think of g as a dimensionless factor that 
has been inserted to make the magnitudes as well as the units come out right. 

Let’s turn to the Stern-Gerlach experiment itself. Figure 1.1a shows a schematic 
diagram of the apparatus. A collimated beam of silver atoms is produced by evap¬ 
orating silver in a hot oven and selecting those atoms that pass through a series of 
narrow slits. The beam is then directed between the poles of a magnet. One of the 
pole pieces is flat; the other has a sharp tip. Such a magnet produces an inhomoge¬ 
neous magnetic field, as shown in Fig. 1.1b. When a neutral atom with a magnetic 
moment fi enters the magnetic field B, it experiences a force F ~V(ji ■ B), since 
-fi- B is the energy of interaction of a magnetic dipole with an external magnetic 
field. If we call the direction in which the inhomogeneous magnetic field gradient is 
large the z direction, we see that 


F, 


3B 

H — 

3z 


l l z 


3 z 


(1.4) 


In this way they could account for the electron’s spin angular momentum and magnetic dipole 
moment. The splitting of the energy levels that was needed to account lor the doublet could then 
be understood as due to the potential energy of interaction of the electron’s magnetic moment in 
the interned magnetic field of the atom (see Section 11.5). Goudsmit and Uhlenbeck wrote up their 
results for their advisor P. Ehrenfest, who then advised them to discuss the matter with H. Lorentz. 
When Lorentz showed them that a classical model of the electron required that the electron must 
be spinning at a speed on the surface approximately ten times the speed of light, they went to 
Ehrenfest to tell him of their foolishness. He informed them that he had already submitted their 
paper for publication and that they shouldn’t worry since they were “both young enough to be able 
to afford a stupidity.” Physics Today, June 1976, pp. 40-48. 
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Notice that we have taken the magnetic field gradient 3 B z /dz in the figure to be neg¬ 
ative, so that if ix, is negative as well, then F z is positive and the atoms are deflected 
in the positive z direction. Classically, /x z = \/i\ cos (9, where (9 is the angle that the 
magnetic moment fi makes with the z axis. Thus /x z should take on a continuum of 
values ranging from +/x to -p. Since the atoms coming from the oven are not polar¬ 
ized with their magnetic moments pointing in a preferred direction, we should find a 
corresponding continuum of deflections. In the original Stern-Gerlach experiment, 
the silver atoms were detected by allowing them to build up to a visible deposit on a 
glass plate. Figure 1.2 shows the results of this original experiment. The surprising 
result is that \i z takes on only two values, corresponding to the values ±H/2 for 
Numerically, h = h/2rc = 1.055 x 1CT 27 erg • s = 6.582 x 10~ 16 eV • s, where h 
is Planck’s constant. 
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Figure 1.2 A postcard from Walther Gerlach to Niels Bohr, dated February 8, 1922. 
Note that the images on the postcard have been rotated by 90° relative to Fig. 1.1, where 
the collimating slit is horizontal. The left-hand image of the beam profile without the 
magnetic field shows the effect of the finite width of this collimating slit. The right-hand 
image shows the beam profile with the magnetic field. Only in the center of the apparatus 
is the magnitude of the magnetic field gradient sufficiently strong to cause splitting. The 
pattern is smeared because of the range of speeds of the atoms coming from the oven. 
Translation of the message: “My esteemed Herr Bohr, attached is the continuation of 
our work [vide Zeitschr. f. Phys. 8, 110 (1921)]: the experimental proof of directional 
quantization. We congratulate you on the confirmation of your theory! With respectful 
greetings. Your most humble Walther Gerlach.” Photograph reproduced with permission 
from the Niels Bohr Archive. 
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Silver atoms are’bomposed of 47 electrons and a nucleus. Atomic theory tells 
us the total orbital and total spin angular momentum of 46 of the electrons is equal 
to zero, and the 47th electron has zero orbital angular momentum. Moreover, as 
(1.3) shows, the nucleus makes a very small contribution to the magnetic moment 
of the atom because the mass of the nucleus is so much larger than the mass of the 
electron. Therefore, the magnetic moment of the silver atom is effectively due to 
the magnetic moment of a single electron. Thus, in carrying out their experiment. 
Stem and Gerlach measured the component of the intrinsic spin angular momentum 
of an electron along the z axis and found it to take on only two discrete values, 
+h /2 and —h/ 2, commonly called “spin up” and “spin down," respectively. Later, 
we will see that these values are characteristic of a spin-^ particle. Incidentally, we 
chose to make the bottom N pole piece of the Stern-Gerlach (SG) device the one 
with the sharp tip for a simple reason. With this configuration, B z decreases as z 
increases, making 3ZL/3z negative. As we noted earlier, atoms with a negative /z, 
are deflected upward in this held. Now an electron has charge q = — e and from (1.3) 
with g = 2, Hz = ( —e/m e c)S .. Thus a silver atom with S z — h/ 2, a spin-up atom, 
will conveniently be deflected upward. 

1.2 Four Experiments 


Now that we have seen how the actual Stern-Gerlach experiment was done, let’s turn 
our attention to four simple experiments that will tell us much about the structure 
of quantum mechanics. If you like, you can think of these experiments as thought 
experiments so that we needn’t focus on any technical difficulties that might be faced 
in carrying them out. 

EXPERIMENT 1 

Let us say a particle that exits an SGz t device, one with its inhomogeneous magnetic 
field parallel to the z axis, with S z = +h /2 is in the state |+z). The symbol |+z), 
known as a ket vector, is a convenient way of denoting this state. Suppose a beam 
of particles, each of which is in this state, enters another SGz device. We find that 
all the particles exit in the state |+z); that is, the measurement of S z yields the value 
+h/2 for each of the particles, as indicated in Fig. 1.3a. 

EXPERIMENT 2 

Consider a beam of particles exiting the SGz device in the state |+z), as in Exper¬ 
iment 1. We next send this beam into an SGx device, one with its inhomogeneous 
magnetic field oriented along the x axis. We find that 50 percent of the particles exit 
the second device with S x = h/2 and are therefore in the state |+x), while the other 
50 percent exit with S x = — h/2 and are therefore in the state |— x) (see Fig. 1.3b). 
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magnet, figure I.4D maicates me pains mat spin-up ana spm-uuwu puwiw «vum 
follow in this modified SG device. 

You mieht think such a device serves no purpose, but we can use a modified 
SG device to make a measurement and select a particular spin state, For example. 


4 R. p. Feynman, R. B, Leighton, and M Sands, The Feynman Lectures on Physics, Addisoti- 
Wesiey, Reading. MA, 1965, vol. 3, Chapter 5. 
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Figure 1.5 Selecting a spin-up state with a modified Stern-Gerlach device 
by blocking the spin-down state. 


if the direction of the inhomogeneous magnetic field of the three magnets is along 
the x axis, we can select a particle in the |+x) spin state by blocking the path that 
a particle in the |-x) spin state would take, as indicated in Fig. 1.5. Then all the 
particles exiting the modified three-magnet SGx device would be in the state |+x). 
In fact, we can repeat Experiment 3 with the SGx device replaced by a modified SGx 
device. If the |-x) state is filtered out by inserting a block in the lower path, we find, 
of course, exactly the same results as in Experiment 3; that is, when we measure 
with the last SGz device, we find 50 percent of the particles in the state |+z) and 
50 percent in the state |-z). Similarly, if we filter out the state |+x) by inserting a 
block in the upper path, we also find 50 percent of the particles exiting the last SGz 
device in the state |+z) and 50 percent in the state |—z). 

EXPERIMENT 4 

We are now ready for Experiment 4. As in Experiment 3, a beam of particles in the 
state |+z) from an initial SGz device enters an SGx device, but in this experiment it 
is a modified SGx device in which we do not block one of the paths and, therefore, 
do not make a measurement of S x . We then send the beam from this modified SGx 
device into another SGz device. As indicated in Fig. 1.6, we find that 100 percent 
of the particles exit the last SGz device in the state |+z), just as if the modified SGx 
device were absent from the experiment and we were repeating Experiment 1. 

Before carrying out Experiment 4, it might seem obvious that 50 percent of the 
particles passing through the modified SGx device are in the state j +x) and 50 percent 
are in the state |—x). But the results of Experiment 4 contradict this assumption, 
since, if it were true, we would expect to find 50 percent ot the particles in the state 
|+z) and 50 percent of the particles in the state | —z) when the unfiltered beam exits 
the last SGz device. Our results are completely incompatible with the hypothesis that 
the particles traversing the modified SGx device have either S x — h /2 or S x — —h/2. 



Figure 1.6 A block diagram of Experiment 4. Note that we cannot indicate 
the path followed through the three-magnet modified SGx device since no 
measurement is carried out to select either a |+x) or |—x) spin state. 
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Moreover, even if we carry out the experiment with a beam of such low intensity that 
one particle at a time is passing through the SG devices, we still find that each of the 
particles has S z = h/2 when it leaves the last SGz device. Thus, the issue raised by 
this experiment cannot be resolved by some funny business involving the interactions 
of the particles in the beams as they pass through the modified SGx device. 

So far, we have been able to describe the results of these Stem-Gerlach exper¬ 
iments simply in terms of the percentage of particles exiting the SG devices in a 
particular state because the experiments have been carried out on a beam of parti¬ 
cles, namely, on a large number of particles. For a single particle, it is generally not 
possible to predict with certainty the outcome of the measurement in advance. In Ex¬ 
periment 2, for example, before a measurement of S x on a particle in the state |+z), all 
we can say is that there is a 50 percent probability of obtaining S x = h /2 and a 50 per¬ 
cent probability of obtaining S x = —h/2. However, probabilities alone do not permit 
us to understand Experiment 4. We cannot explain the results of this experiment by 
adding the probabilities that a particle passing through the modified SGx device is in 
the state |+x> or in the state |—x), since this fails to account for the differences when 
comparing the results of Experiment 3, in which 50 percent of the particles in the state 
|+x) (or j —x)) yield S z = —h/2, with the results of Experiment 4, in which none of 
the particles has S z — —h/2 when exiting the last SGz device. Somehow in Experi¬ 
ment 4 we must eliminate the probability that the particle is in the state | —z) when it 
enters the last SGz device. What we need is some sort of “interference” that can can¬ 
cel out the |—z) state. Such interference is common in the physics of waves, where 
two waves can interfere destructively to produce minima as well as constructively to 
produce maxima. With electromagnetic waves, for example, it isn’t the intensities 
that interfere but rather the electromagnetic fields themselves. For electromagnetic 
waves the intensity is proportional to the square of the amplitude of the wave. With 
this in mind, for our Stern-Gerlach experiments we introduce a probability ampli¬ 
tude that we will “square” to get the probability. If we don't observe which path 
is taken in the modified SGx device by inserting a block, or filter, we must add the 
amplitudes to take the two different paths corresponding to the |+x) and |— x) states. 
Even a single particle can have an amplitude to be in both states, to take both paths; 
when we add, or superpose, the amplitudes, we obtain an amplitude for the particle 
to be in the state |+z) only. 5 In summary, when we don’t make a measurement in 
the modified SG device, we must add the amplitudes, not the probabilities. 


3 In Section 2.3 we will discuss in more detail how this interference in Experiment 4 works. 
These results are reminiscent of the famous double-slit experiment, in which it seems logical 
to suppose that the particles go through one slit or the other, but the interference pattern on a 
distant screen is completely incompatible with this simple hypothesis. The double-slit experiment 
is discussed briefly in Section 6.7. If you are unfamiliar with this experiment from the perspective 
of quantum mechanics, an excellent discussion is given in The Feynman Lectures on Physics , 
vol. 3, Chapter 1. 
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1.3 The Quantum State Vector 


In our description of the state of a particle in quantum mechanics, we have been 
using a new notation in which states, such as |+z), are denoted by abstract vectors 
called ket vectors. Such a description includes as much information about the state 
of the particle as we are permitted in quantum mechanics. For example, the ket |+x) 
is just a shorthand way of saying that the spin state of the particle is such that if we 
were to make a measurement of S x , the intrinsic spin angular momentum in the x 
direction, we would obtain the value /i/2. There are clearly other attributes that are 
required to give a complete description of the particle, such as the particle’s position 
or momentum. However, for the time being we are concentrating on the spin degrees 
of freedom of the particle. 6 Later, in Chapter 6, we will see how to introduce other 
degrees of freedom in the description of the state of the particle. 

Classical physics uses a different type of vector in its description of nature. Some 
of these ordinary vectors are more abstract than others. For example, consider the 
electric field E, which is a useful but somewhat abstract vector. If there is an electric 
field present, we know that a test charge q placed in the field will experience a force 
F = qE. Of course, even the force F will not be observed directly. We would probably 
allow the particle to be accelerated by the force, measure the acceleration, and then 
use Newton’s law F = ma to determine F and thence E. 

Let’s suppose the electric field in the location where you are reading this book has 
a constant value, which you could determine in the way we have just outlined. How 
do you tell your friends about the value, both magnitude and direction, of E? You 
might just point in the direction of E to show its direction. But what if your friends are 
not present and you want to write down E on a piece of paper? You would probably 
set up a coordinate system and choose basis vectors i, j, and k whose direction 
you could easily communicate. Using this coordinate system, you would denote the 
electric field as E = E,\ + E y j + £ z k. In fact, we often use a shorthand notation 
in which we suppress the unit vectors and just say E = (E x , E y , E z ), although in 
the notation we will be using in our discussion of quantum mechanics, it would be 
better to denote this as E -x ( E x , E y , E z ). How do we obtain the value for E x , for 
example? We just project the electric field onto the x axis. Formally, we take the dot 
product to find E x — i • E = |E| cos 9, where 9 is the angle the electric field E makes 
with the x axis, as shown in Fig. 1,7. 

Let’s return to our discussion of quantum state vectors. If we send a spin- \ particle 
into an SGz device, we obtain only the values h/2 and —H/2, corresponding to the 


6 The historical development of quantum mechanics initially focused on the more obvious 
degrees of freedom, such as a particle’s position. In fact, Goudsmit was fond of relating how, 
when confronted with the need to introduce a new degree of freedom for the intrinsic spin of 
the electron in order to explain atomic spectra, he had to ask Uhlenbeck what was meant by the 
expression “degree of freedom " 
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y * 



Figure 1.7 The x and y components of an electric held 
E making an angle 9 with the x axis can be obtained by 
taking the dot product of E with the unit vectors i and j. 
For a classical vector such as E, E x and E. can also be 
obtained by projecting E onto the x and y axes. 


particle ending up in the state |+z) or ending down in the state |— z), respectively. 
These two states can be considered as vectors that form a basis for our abstract 
quantum mechanical vector space. If the particle is initially in the state |+z), we 
have seen in Experiment 1 that there is zero amplitude for the particle to be found in 
the state | —z), which we denote by {—z|+z) = 0. We can think of this as telling us 
that the vectors are orthogonal, the analogue of i • j — 0 in our electric field example. 
Of course if we send a particle in the state 14-z) into an SGz device, we always find the 
particle in the state |+z). In the language of quantum mechanical amplitudes this is 
clearly telling us that the amplitude {Fz|+z) is nonzero. As we will see momentarily, 
it is convenient to require that our quantum mechanical vectors be unit vectors and 
therefore satisfy (+z|+z) = 1, just as i • i — 1. We similarly require that {—z|—z) = 1 
as well, just as j ■ j = 1. 

Suppose the particle is in the state |+x). From Experiment 3 we know that the 
particle has nonzero amplitudes, which we can call c + and c_, to be in the states 
|+z) and |—z), respectively. We can express this state as |+x) = c + |+z) + c_|— z), 
a linear combination of the states |+z) and [— z). In fact, it is convenient at this 
stage to consider an arbitrary spin state |tjr), which could be created by sending a 
beam of particles with intrinsic spin-4 through an SG device with its inhomogeneous 
magnetic field oriented in some arbitrary direction and selecting those particles that 
are deflected, for example, upward. In general, this state, like |+x), will have nonzero 
amplitudes to yield both ft/2 and —ft/2 if a measurement of S z is made. Thus we 
will express this state 1 1 //) as 


\xfr) =c + |+z) + cj-z) (1.5) 

where the particular values for c, and c depend on the orientation of the SG device. 
That an arbitrary state | //) can be expressed as a superposition of the states |+z) and 
| —z) means that these states form a complete set, just as the unit vectors i, j, and k 
form a complete set for expressing an electric field E in three dimensions. Although 
we are describing the states of spin angular momentum of a spin-! particle in, of 
course, three dimensions, we need only the basis states |+z) and |—z) to span this 
two-dimensional vector space. 
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How can we formally determine the values of c + and c ? In order to take the 
analogue of the dot product in our ordinary classical vector example, we need 
to introduce a new type of vector called a bra vector. 7 For every ket |i j/) there 

corresponds a bra Thus we have two different ways to denote a state with 

5, = /i/2, with the ket |+z) and the bra (+z|. The fate of a bra such as (ip\ is to 
meet up with a ket |t/r) to form an amplitude, or inner product, (ip |t/r) in the form of 
a bracket—hence the name for bras and kets. The amplitude {(p\ t/r) is the probability 
amplitude for a particle in the state |t//} to be found in the state \<p). From our earlier 
experiments we know that (—z|+z) = 0, and similarly (+z|—z) = 0, since a particle 
in the state |—z), with S z = —/i/2, has zero amplitude to be found in the state |+z), 
with S z = h/2. Thus from (1.5), we can deduce that 

(+zji/f) = c + (+z|+z) + c_(+z|— z) = c + (1.6a) 

(—z|V'} = c + (-z|+z) + c_(-z|-z) = c_ (1.6b) 

or simply c ± = (±z|t/r). This enables us to express (1.5) in the form 


\ty) = H-z|£) l+z) -f ( z 1 ) i-z) = |+z)(+z|i p) + | z)( x\\jf ) (1.7) 

C_j_ C_ 


where in the last step w'e have positioned the amplitudes after the kets in a suggestive 
way. Note that the amplitudes ( t zh/V) and (—zh jr), the brackets, are (complex) 
numbers, and thus the product of an amplitude times a ket vector is itself just a 
ket vector. It really doesn’t matter whether we position the amplitude before or after 
the ket. Writing the ket vector j <//) in the form (1.7) is analogous to expressing the 
electric field E in the form E = E x \ + £ y j + F/k = i(i • E) + j(j • E) + k(k • E). 

Since to each ket there corresponds a bra vector, we must be able to express (//1 
in terms of <+zj and (—z] as 


(■^1 =c' + {+t\ -F<C(-z| (1-8) 

Using the same technique as before, we see that 

(i/f|+z) =c^(+z|+z) + c'_(— z|+.z) =c' + (1.9a) 

(xj/'\ z) = c^(+z|-z) + c'_(-z|-z) = c'_ (1.9b) 

Thus the bra corresponding to the ket in (1.7) is 

{f \ = <i/'! 1 z)< 1 z : + (i// j — z) (—z| (1.10) 



7 Mathematicians call the linear vector space spanned by the bra vectors the dual space. 
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How are the ampijtudes (+z|t jr) and (if |+z) related? Just as we require that 
(+z|+z) = 1, we also require that (iff \ if) = 1. We are demanding that all physical 
vectors in our abstract quantum mechanical vector space be unit vectors. As we will 
now see, this requirement is crucial to the probabilistic interpretation of quantum 
mechanics. If we use (1.7) and (1.10) to evaluate (i/r|i/r), we find 

(f\f) = (f\+z)(+z\f) + (if |-z)(-z|i/r) = 1 (1.11) 

In Section 1.5 we will examine a final Stem-Gerlach experiment that will convince 
you that amplitudes such as (+z| if) and (— z\rjr) are in general complex numbers. 
The way to guarantee that equality (1.11) is satisfied for arbitrary \if)’s is to have 

(fl+z) = (+z\f)* and (i/f\-z) = (~z\f)* (1-12) 

so that each of the terms in (1.11) is real. These results say that the amplitude for a 
particle in the state | if) to be found in the states |±z) is the complex conjugate of 
the amplitude for a particle in the states |±z) to be found in the state y/}. 

From (1.6) and (1.9), we see that c' + = c* and c'_ — c*_. Therefore, the bra 
corresponding to the ket (1.5) is 

(if\=c* + (+z\+c*J-z\ (1.13) 

The bra vector is generated from the ket vector by changing all the basis kets to 
their corresponding bras and by changing all amplitudes (complex numbers) to their 
complex conjugates. 

With these results, we can express (1.11) as 

mf) = (+z\if)*(+z\if) + <-z| if)*(-z\if) 

= c*c + + c*c_ = 1 (1.14) 

or 

wm=\(+zm 2 + =1 ci.is) 

where |{+z|t/r)| 2 . (+z\tJ/)* (+z\is) and \(—z\i /)\ 2 = (—z\\j/)*{—ziO). We interpret 
\(+z\i /)\ 2 as the probability that a particle in the state \i/) will be found to be in 
the state j+z) if a measurement of S z is made with an SGz device and | (—z |;// > i 2 as 
the probability that the particle will be found in the state |—z). As (1.15) shows, the 
requirement that (i//jr//) = 1 guarantees that the probability of finding the particle in 
either one state or the other sums to one, since there are only two results possible 
for a measurement of S z for a spin-^ particle. 

The striking feature of (1.7) is that when both of the probability amplitudes 
(+z\ir) and (are nonzero, then a particle in the state h,//) is really in a 
superposition of the states |+z) and |—z). There are probabilities of obtaining both 
S : = h/2 and S : = —hf 2 if a measurement of S. is carried out. This is to be contrasted 
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with classical mechanics, where for a particle in a definite state we do not expect 
measurements of, say, the orbital angular momentum of the particle at a particular 
time to yield two different values, such as rj x p! and r 2 x p 2 . 




EXAMPLE 1.1 A measurement of S z is carried out on a particle in the state 

\f) ' 




What are the possible results of this measurement and with what probability 
do these results occur? 


SOLUTION Since 


and consequently 


(+z\f) = - 


| <+z| Vr>| 2 = ^ 

4 

therefore there is a 25 percent probability of obtaining S z = h/2. Similarly, 

iV3 


and 


<-z| f) = 


im|2 -i^3\ iV3\ 3 


therefore there is a 75 percent probability of obtaining S z = —h/2. Since the 
state | ifr) is appropriately “normalized,” namely 

(f\f) = \{+z\f)\ 2 + |{-z|^)| 2 = ] + ^ = 1 

4 4 

these probabilities must sum to one since the only results of a measurement 
of S : for a spin-! particle are h/2 and —h/2. 


1.4 Analysis of Experiment 3 


As we noted earlier. Experiment 3 is telling us that a particle in the state |+x) is in 
a superposition of the states |+z) and |—z) : |+x) = c + |+z) + c_|— z), since when 
we make measurements of S, with the last SGz device in the experiment, we have 
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probabilities ol obta|ning both h/2 and —h/ 2. Because the probabilities are each 
50 percent, we have 


C +C + = (+z|+x)*(+z|+x) = |<+z|+x)| 2 = \ (1.16a) 

C*_C_ = <-z|+x)*<-z|+x) = |(-z|+x)| 2 = \ (1.16b) 

One solution is to choose c + and c_ to be real, namely c + = l/\/2 and c_ = 1 / v'2. 
The more general solution for c + and c_ may be written as 


JS-L 


V2 


and 



(1.17) 


where <5 + and 5 are real phases that allow for the possibility that c + and c_ are 
complex. 8 The ket for the state with S x = h/2 is then given by 




l+x) = e —=;\ + z) + e —=r\-z) 
v2 V2 


(1.18) 


Notice that the probabilities (1.16) themselves do not give us any information about 
the values of the phases <5^ and <$_, since the phases cancel out when we calculate 
c* c + and c*_c_: 


c + c + 


c c = 


-iS: 




e iS + \ 

i 


V2J 

_ 2 

(1.19a) 

e' s - \ 

1 


7^) 

_ 2 

(1.19b) 


We can use these probabilities to calculate the average value, or expectation 
value, of S : , which is the sum of each value obtained by a measurement of S z 
multiplied by the probability of obtaining that value: 


<s z ) = <M-i + c: 



2 

2 




( 1 . 20 ) 


In this particular case, the expectation value doesn’t coincide with any of the values 
that may be obtained by measuring S z . An idealized set of data resulting from 


s A common way to express a complex number z is in the form z = x + iy, where x and 
y —the real and imaginary parts of z, respectively—give the location of z in the complex plane. 
Alternatively, we can express the coordinates for z in the complex plane using the magnitude r of 
the complex number and its phase <j >, where x = r cos <p and y = r sin <p. Then z = re where we 
have taken advantage of the Euler identity e'^ = cos 4> + > sin 4>. The complex conjugate of the 
complex number z — x + iy = re" t> is obtained by replacing i by — i, that is z* = x — iy — re~" t ’. 
Therefore. z*z = re^'^re 1 ^ = r - e ( ~"P+"P) = r - 
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# 


Figure 1.8 An idealized set of data result¬ 
ing from measurements of S z on a collection 
_f l /2 f t /2 Sz of particles with S x — ft/2. 


measurements of S, on a very large collection of particles, each with S x ~ h/2, 
is shown in Fig. 1.8. Clearly, there is an inherent uncertainty in the result of the 
measurements, since the measurements do not all yield the same value. We calculate 
this uncertainty by computing the standard deviation: we determine the average 
value of the data, take each data point, subtract the average value from it, square 
and average, and finally take the square root. Thus the square of the uncertainty is 
given by 

(AS,) 2 = <(S Z - (S z )) 2 ) 

= <S 2 - 2 S Z (S Z ) + (S z ) 2 ) 

= (S 2 ) - 2{S Z ){S,} + (S z ) 2 

= (S 2 ) - (S z ) 2 (1.21) 

The expectation value (Sr) is the sum of each value of S 2 multiplied by the proba¬ 
bility of obtaining that value: 



Therefore, substituting (1.20) and (1.22) into (1.21), we find AS, = h/2 for aparticle 
in the state | +x). We call A S z the uncertainty rather than the standard deviation since 
a single particle in the state |+x) does not have a definite value for S,. 9 

Of course, (S z ) = 0 is not in disagreement with finding a single particle to be 
spin up if we make a measurement of S z on a particle in the state |+x). To test 
predictions such as (1.20) requires a statistically significant sample. Suppose we 
make measurements of S z on 100 particles, each in the state |+x), and find 55 of 
them to be spin up ( S z = h/2) and 45 of them to be spin down (S z = —h/2). Should 
we be worried about a disagreement with the predictions of quantum mechanics? 


9 The experimental evidence for this assertion will be discussed in Section 5.5. 
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In general, if we make N measurements, we should expect fluctuations that are on 
the order of v77. Thus with 100 measurements, deviations from (S z ) = 0 on the 
order of 10 percent are reasonable. However, if we were to make 10 6 measurements 
and find 550,000 particles spin up and 450,000 particles spin down, we should be 
concerned, since we should expect fluctuations of only about \/N = 1,000, rather 
than the measured 50,000. 


EXAMPLE 1.2 As in Example 1.1, a spin-^ particle is in the state 

I f) = ^l+z> + 

What are the expectation value (S z ) and the uncertainty A S z for this state? 

SOLUTION 


{S z ) = |{+z|^)| 


2 / R \ , i j 


+ \(-z\f)\ 


“ 4 1.2/ + 4 l 2) ~ 4 


and 


(£?) = !(+z|^)| 

l (b 2 


Consequently 


A S z = 


+ 


2 

3 (h 1 


+ \{-z\f)\' 


(-8 


ft 2 

4 


yfiS 2 } ~(S Z ) 2 


The uncertainty AS, is 0.43/1 for the state |t//}, which is smaller than the 
value 0.50// for the state |+x), reflecting the fact that there is a 75 percent 
probability of obtaining fi/2 if a measurement of S z is carried out for the 
state \itf) as compared with 50 percent probability for the state |+x). Of 
course, if the state of the particle were |+z), then there would be a 100 
percent probability of obtaining ti/2 if a measurement of S z is carried out. 
Correspondingly, AS Z vanishes in this case. 
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1.5 Experiment 5 


We are now ready to consider the final Stern-Gerlach experiment of this chapter. 
In this experiment. Experiment 5. we replace the last SGz device of Experiment 3 
with one that has its inhomogeneous magnetic field in the y direction and thus 
make measurements of S y on particles exiting the SGx device in the state |+x). 
From Experiment 3 we already know the results of this final experiment. We must 
find 50 percent of the particles with S y = ft/2 and 50 percent of the particles with 
S v = —ft/2. Figure 1.9 shows the last two Stern-Gerlach devices in Experiment 3 
and in Experiment 5. Although we are measuring S v instead of S, with the last SG 
device in Experiment 5, the percentage of the particles that go "up” and “down” must 
be the same for Experiment 3 and Experiment 5, since the axis that we called the z 
axis in Experiment 3 could just as easily have been called the y axis, either by us or 
by another observer viewing the experiment. In fact, this sort of argument tells us 
that if we were to replace the SGx device in Experiment 3 with an SGy device, we 
would still find that 50 percent of the particles have S z = ft/2 and 50 percent have 
S z = — ft/2 when exiting the last SGz device. 

These simple results have important implications. Just as we are able to express 
the state |+x) by (1.18), we can express the state |+y) as a superposition of |+z) 
and |—z) in the form 


iy+ 


o'Y- 


l+,)= vf l+z) + 7f 


|-z) = e —~= [|Tz> + e ,iy K+) |-z) 

*/? L 


(1.23) 


where we have written the complex numbers multiplying the kets |+z) and ]— z) in 
such a way as to ensure that there is a 50 percent probability of obtaining S z = ft/2 
and a 50 percent probability of obtaining S z = —ft/2. Note that in the last step we 
pulled out in front an overall phase factor e ,y + for future computational convenience. 
Moreover, since in Experiment 5 there is a 50 percent probability of finding a particle 


SGx 


S z = hl 2 


No 


SGz 


s 7 =m 


S, = -h!2 


No/2 
Nq /2 


(a) 



No/2 

No/2 


(b) 


Figure 1.9 Block diagrams showing the last two SG 
devices in (a) Experiment 3 and in (b) Experiment 5. 
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with S v = h/2 when jt exits the SGx device in the state |+x), we must have 

K+y|+x)l 2 = ^ 0-24) 

Now the bra corresponding to the ket (1.23) is 


<+y| 


e~ ,y + , , e 

-<+z| + 


-i Y- 


<y+ 


V2 


42 


-z = 


42 L 


<+z| + e~ iiY -~ Y ^(-z\\ (1.25) 


where we have replaced the complex numbers in (1.23) with their complex conju¬ 
gates in going from (1.23) to (1.25). If we rewrite (1.18) by pulling out an overall 
phase factor: 


|+x> = € — [|+z) + e i(S - S+) \-z) 


(1.26) 


then 


(+y|+x) = -—'- (<+z| + e ,y (-z\) (j+z) + <? ,<5 |-z}) 


2 

e i(,S + -y + ) 


1 + e‘ 


i(S-y) 


(1.27) 


where 8 = S_ - 5 + and y = y_ — y + are the relative phases between the kets 
|+z) and |—z) for these two states, and we have used (+z|+z) = {—z|—z) = 1 
and (+z|-z) = {—z|+z) =0 in evaluating the amplitude. We finally calculate the 
probability: 


l(+y|+x)r = 


„i(& + -Y+) 


[l + e dS-K>]J| 

- [l + e i(4_y) ] [l + e~ i(S - y) ] 


-i(i+-y+) 


1 -f- £ 


-i(S-y) 


2 

2 


[l + cos(<$ - y)] 


(1.28) 


Agreement with (1.24) requires 8 - y — ±tc/2. The common convention, which we 
will see in Chapter 3, is to take 8 = 0. If in (1.23) and (1.26) we ignore the overall 
phases 5 + and y + , which appear in the amplitude (1.27) but do not enter into the 
calculation of the probability (1.28), we see that 


and 


l+x) — —=|+z) H —\ 
V2 v2 


-z) 


(1.29) 


1 e in ! 2 

l +, ) = -^ | + z ) + -^ 1 


-z) 


42 


l+z) 


l 

7? 


(1.30) 
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z 




Figure 1.10 A state that is spin down along v in 
the right-handed coordinate system shown in (a) is 
spin up along y in the left-handed system shown 
in (b). 


where we have chosen y = tt/ 2. The choice y — —it/2 yields the state 


72 


l+z} 


l 

71 


l-z) = l-y) 


(1.31) 


The reason for this ambiguity is that in discussing our series of Stern-Gerlach 
experiments we have not specified whether our coordinate system is right handed 
or left handed. The state we have called |+y) is indeed the state with S v = h/2 in 
a right-handed coordinate system. The state we have called |—y) is the state with 
S y = —h/2 in our right-handed coordinate system. Of course, this latter state, which 
is spin down along y, is spin up along y in a left-handed coordinate system, as shown 
in Fig. 1.10. That is why we see both solutions appearing. 10 

These complications should not detract from the main message to be learned 
from Experiment 5. The simple fact is that (1.24) cannot be explained without a 
complex amplitude. The appearance of i’s such as the one in (1.30) is one of the key 
ingredients of a description of nature by quantum mechanics. Whereas in classical 
physics we often use complex numbers as an aid to do calculations, there they are 
not essential. The straightforward Stern-Gerlach experiments we have outlined in 
this chapter demand complex numbers for their explanation. 






EXAMPLE 1.3 A spin-4 particle is in the state 


\f) = \\+i) + “l~Z> 


10 We will see how to derive all of the results of this section front first principles in Chapter 3. 
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What is the probability that a measurement of S y yields ft/2? What is (S ) 
for this state? 

SOLUTION From (1.30), we know that 


[+y> = 4=1+.) + 4,1-z) 

v2 sfl 


Thus the coaesponding bra vector is 


1 


<+yl = -^ <+z l 

V2 




The probability amplitude for finding a particle in the state |t/r) with S\, = ft/2 
if a measurement of S v is carried out is given by 


(+yW = (^(+,1 - ^(-x[) (?!«) + ^|_ 2) ) 

Therefore the probability is given by 

1 

l(+y|^)l 2 = - + “ = °- 93 

To get a physical feel for what the spin state ji/r) is and why the probability 
of finding the particle in this state with S y = ft /2 is as large as 0.93, take a 
look at Problem 1.10. 

Since a measurement of S v yields either +h/2 or -ft/2, the probability 
of obtaining S y = —ft/2 is given by 

K-yWl 2 - 1 - K+ylVOI 2 = = o.oi 


Therefore 



1.6 Summary 


The world of quantum mechanics is both strange and wonderful, in part because it is 
a world filled with surprises that so often am counter to our classical expectations. 
Yet as we go on, we will see the remarkable insight quantum mechanics gives us 


Page 37 (metric system) 




22 I 1. Stern-Gerlach Experiments 


not just into microscopic phenomena but into the laws of classical mechanics as 
well. Since quantum mechanics subsumes classical mechanics, we cannot “derive” 
quantum mechanics from our classical, macroscopic experiences. Our strategy in 
this chapter has been to take a number of Stern-Gerlach experiments as our guide 
into this strange world of quantum behavior. From these experiments we can see 
many of the general features of quantum mechanics. 

A quantum state is specified either by a ket vector |i (r) or a corresponding bra 
vector {xfr\. The complex numbers that we calculate in quantum mechanics result 
from a ket vector |i jt) meeting up with a bra vector (<p\, forming the bra(c)ket (<p\f), 
which we call the probability amplitude for a particle in the state |t/r) to be found in 
the state \(p). The amplitude (t/r |<p) for a particle in the state \<p) to be found in the 
state \ip) is the complex conjugate of the amplitude for a particle in the state \xj/) to 
be found in the state | <p): 


(f\ <p) = (<p\f)* (1-32) 

The probability of finding a particle to be in the state | (p) when a measurement is 
made on a particle in the state \\[r) is given by \{(p\ifr)\ 2 . Notice that the probability is 
unchanged if the ket |i j/) is multiplied by an overall phase factor e lS : | i//} —* e iS \x/r). 

Although we have phrased our discussion so far solely in terms of the intrinsic 
spin angular momentum of a spin -4 particle, the structure that we see emerging has 
a broad level of applicability. Suppose that we are considering an observable A 
for which the results of a measurement take on the discrete values a h a 2 , ay . .. . !1 
As we will see, angular momentum and energy are good examples of observables 
for which the results of measurements can be grouped in a discrete (although not 
necessarily finite) set. A general quantum state, expressed in the form of a ket vector 
\ilr), can be written as a superposition of the states |a t ), \a 2 ), Iab), • ■ • that result if 
a measurement of A yields aj, a 2 , (iy . . ., respectively: 

W) — c \\ a \) + c l\ a l) + T 3 IO 3 ) + • ■ • = ^2 c n\ a n) (1.33) 

n 

The corresponding bra vector is given by 

(xff \ =c*(a,| + c* 2 (a 2 \ + c l{ a -i\ + • • • = r c*(a n [ (1-34) 

n 

The complex number 

G, = <«n \f) (1-35) 


11 The extension to observables such as position and momentum where the values form a 
continuum is discussed in Chapter 6. 
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is the amplitude to dbtain a„ if a measurement of A is made for a particle in the 
state | if). 12 

Physically, we expect that 


( a Mj) = 0 i£j (1.36) 

since if the particle is in a state for which the result of a measurement is a •, there is 
zero amplitude of obtaining <% with i A j. The vectors \a f ) and \a/} with i ^ j are 
said to be orthogonal. The amplitude to obtain a ; for a particle in the state \a,) is 
taken to be one, that is. 


= 1 (1.37) 

The vector | a t ) is then said to be normalized. Equations (1.36) and (1.37) can be 
nicely summarized by 


(1.38) 

where <5, ; - is called the Kronecker delta defined by the relationship 



0 i ¥= J 
1 i = j 


(1.39) 


We say that the set of vectors |a,-} form an orthonormal set of basis vectors. 
Equation (1.33) shows how an arbitrary vector \i(/} can be expressed in terms of 
this basis set. Thus the vectors | a t ) form a complete set. 

Amplitudes such as (1.35) can be projected out of the ket | \j /) by taking the inner 
product of the ket |i//) with the bra {«, j: 


■i- > 

n 

^ ^n^in (1.40) 

n 

Thus the ket (1.33) can be written 

l^> = \ a n)( a n\1') (1.41) 

n 

which is just a sum of ket vectors |a ; ), each multiplied by the amplitude (a, \ f ). 


12 In this chapter we have used the shorthand notation |S, = ±ft/2) = |±z), |S* = ±h/ 2) =. 
I Ax)., and so on. Thus (±z\f) are the amplitudes to obtain S z = ±h/2 for a spin-4 particle in the 
state {ifr} if a measurement of S : is made. 
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Similarly, the amplitude c* can be projected out of the bra {xfr by taking the inner 
product with ket \a t )\ 


(fWi) = '%2c* n (a n \a i ) 



n 


The bra (1.34) can thus be written as 

w = Y 1 ^kh«kI 

n 


(1.42) 


(1.43) 


which is the sum of the bra vectors (a ; |, each multiplied by the amplitude (V'K)- 
The normalization requirement 


W\t) - 


for a physical state | xjr) leads to 


1 = {f\f) = c * <«/l j c j\ a j) 

=EE^w 

i j 

= EE4'A-E is I 2 

' j 1 

showing that the probabilities 

\c i \ 2 = \{a i m 2 


(1.44) 


(1.45) 


(1.46) 


of obtaining the result«, if a measurement of A is carried out sum to one. From these 
results it follows that the average value of the observable A for a particle in the state 
\\jr) is given by 

(A) = J2\cfa n (1.47) 

n 

since the average value (expectation value) is the sum of the values obtained by 
the measurements weighted by the probabilities of obtaining those values. The 
uncertainty is given by 

AA = yfl(A - (A)) 2 ) = /(A 2 ) - {A} 2 (1.48) 


where 

(A 2 } = X> m |V (1 49) 

n 
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Equations (1.47) and (1.49) illustrate the importance of completeness, that is, that 
any state can be expressed as a superposition of basis vectors, as in (1.33). Without 
this completeness, we would not know how to calculate the results of measurements 
for the observable A for an arbitrary state. 

One of the most striking features of the physical world is that if more than one 
of the c n in (1.33) is nonzero, then there are amplitudes to obtain different a n for a 
particle in a particular state 1 \jr }. How should we interpret this result: Is the ket (1.33) 
telling us that the particle spends time in each of the states \a n ), and the probability 
\(a n \\j/}\ 2 is just a reflection of how much time it spends in that particular state? Does 
this specification of the state as a superposition just reflect our lack of knowledge 
of which state the particle is really in? Is this why we must deal with probabilities? 
The answer to these questions is an emphatic no. Rather, (1.33) is to be read as a 
true superposition of the individual states \a n ), for if we parametrize the complex 
amplitudes in the form 


(a n W) = \(a n \f)\e iS » (1.50) 

where \{a n \ijr)\ is the magnitude, or modulus, of the amplitude and S n is the phase 
of the amplitude, the difference in phase (the relative phase) between the individual 
states in the superposition matters a great deal. As we have seen in our discussion 
of the spin-:j |+x) and |+y) kets, changing the relative phase between the kets |+z) 
and j—z) in such a superposition by tt/2 changes a state with S x = h/2 into one 
with S y = h/2. Compare (1.29) and (1.30). 13 Thus the values of the relative phases 
in (1.33) dramatically affect how the states “add up,” or how the amplitudes interfere 
with each other. Quantum mechanics is more than just a collection of probabilities. 
We live in a world in which the allowed states of a particle include superpositions of 
the states in which the particle possesses a definite attribute, such as the z component 
of the particle’s spin angular momentum, and thus by superposing such states we 
form states for which the particle does not have definite value at all for such an 
attribute. , 

Problems 


1.1. Determine the field gradient of a 50-cm-long Stem-Gerlach magnet that would 
produce a 1-mm separation at the detector between spin-up and spin-down silver 
atoms that are emitted from an oven at T = 1500 K. Assume the detector (see 
Fig. 1.1) is located 50 cm from the magnet. Note: While the atoms in the oven have 
average kinetic energy 3 k B T /2, the more energetic atoms strike the hole in the oven 
more frequently. Thus the emitted atoms have average kinetic energy 2k B T, where 


13 This also shows that a spin-j particle cannot have simultaneously a definite value for the x 
and v components of its intrinsic spin angular momentum. 
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z 



Figure 1.11 The angles 0 and <j> specifying the orientation of 
an SGn device. 


kn is the Boltzmann constant. The magnetic dipole moment of the silver atom is due 
to the intrinsic spin of the single electron. Appendix F gives the numerical value of 
the Bohr magneton, eh/2m e c, in a convenient form. 


1.2. Show for a solid spherical hall of mass m rotating about an axis through its center 
with a charge q uniformly distributed on the surface of the ball that the magnetic 
moment fi is related to the angular momentum L by the relation 


!*■ = 



6m c 


Reminder: The factor of c is a consequence of our using Gaussian units. If you work 
in SI units, just add the c in by hand to compare with this result. 


1.3. In Problem 3.2 we will see that the state of a spin-^ particle that is spin up along 
the axis whose direction is specified by the unit vector 


n = sin 0 cos 0i + sin 6 sin 0j + cos #k 


with 6 and <j> shown in Fig. 1.11, is given by 

0 6 
i+n) = cos - |+z) + sin — |—z> 

(a) Verify that the state |+n) reduces to the states |+x) and |+y) given in this 
chapter for the appropriate choice of the angles 0 and 0 . 

(b) Suppose that a measurement of S, is carried out on a particle in the state |+n>. 
What is the probability that the measurement yields (i) h/ 2? (ii) —h/ 2? 

(c) Determine the uncertainty A S z of your measurements. 

1.4. Repeat the calculations of Problem 1.3 (b) and (c) for measurements of S x . 
Hint: Infer what the probability of obtaining — h/2 for S x is from the probability of 
obtaining h/2. 


1.5. 

(a) What is the amplitude to find a particle that is in the state |+n) (from Prob¬ 
lem 1.3) with S y = h/2? What is the probability? Check your result by eval¬ 
uating the probability for an appropriate choice of the angles 6 and 0 . 
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Figure 1,12 A Stem-Gerlach experiment with spin-j particles. 


(b) What is the amplitude to find a particle that is in the state |+y) with S n = h/ 2? 
What is the probability? 

1.6. Show that the state 

|—n) = sin — | —j—z) - e'* cos -|-z) 

2 2 


satisfies (+n[—n) =0, where the state |+n) is given in Problem 1.3. Verify that 
(—n|—n) = 1. 

1.7. A beam of spin-| particles is sent through a series of three Stem-Gerlach 
measuring devices, as illustrated in Fig. 1.12. The first SGz device transmits particles 
with S z = h/2 and filters out particles with S z — —h/2. The second device, an SGn 
device, transmits particles with S n = /i/2 and filters out particles with S n = -h/2, 
where the axis n makes an angle 0 in the x-z plane with respect to the z axis. 
Thus particles after passage through this SGn device are in the state |+n> given 
in Problem 1.3 with the angle <j) — 0. A last SGz device transmits particles with 
S z = —h/2 and filters out particles with S, = h/2. 

(a) What fraction of the particles transmitted by the first SGz device will survive 
the third measurement? 

(b) How must the angle 9 of the SGn device be oriented so as to maximize the 
number of particles that are transmitted by the final SGz device? What fraction 
of the particles survive the third measurement for this value off/? 

(c) What fraction of the particles survive the last measurement if the SGn device 
is simply removed from the experiment? 

1 . 8 . The state of a spin-| particle is given by 

w => 2)+ 

What are (S z ) and AS, for this state? Suppose that an experiment is carried out on 
100 particles, each of which is in this state. Make up a reasonable set of data for S z 
that could result from such an experiment. What if the measurements were carried 
out on 1,000 particles? What about 10,000? 

1.9. Verify that A S x = Jisf) - { S x ) 2 = 0 for the state |+x). 
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1.10. The state 

1 iV 3 

\f) = ~!+ z > + — l- z ) 

is a state with S n — h/2 along a particular axis n. Compare the state /p) with the state 
|+n) in Problem 1.3 to find n. Determine {S x ), (S y ), and { S z ) for this state. Note: 
(S z ) and (S y ) for this state are given in Example 1.2 and Example 1.3, respectively. 

1 . 11 . Calculate (S x ), (S y ), and (S z ) for the state 

W = -~\+z) + -^l-z) 

Compare your results with those from Problem 1.10. What can you conclude about 
these two states? 

1.12. The state 

I is) - ^l+ z > + ~ l _z > 

is similar to the one given in Problem 1.10. It is just “missing” the /. By comparing 
the state with the state |+n) given in Problem 1.3, determine along which direction 
n the state is spin up. Calculate (S x ), (S y ), and (S 2 ) for the state \xj/). Compare your 
results with those of Problem 1.10. 

1.13. Show that neither the probability of obtaining the result a, nor the expectation 
value (A) is affected by | if) -> e iS \\p), that is, by an overall phase change for the 
state | ir). 

1.14. It is known that there is a 36% probability of obtaining S, — h/2 and therefore 
a 64% chance of obtaining S z = - H/2 if a measurement of S z is carried out on a 
spin -\ particle. In addition, it is known that the probability of finding the particle 
with S x = h/2, that is in the state |+x), is 50%. Determine the state of the particle 
as completely as possible from this information. 

1.15. It is known that there is a 90% probability of obtaining S z = h/2 if a measure¬ 
ment of S z is carried out on a spin-| particle. In addition, it is known that there is a 
20% probability of obtaining S y — h/2 if a measurement of S y is carried out. Deter¬ 
mine the spin state of the particle as completely as possible from this information. 
What is the probability of obtaining S x = h/2 if a measurement of S x is carried out? 
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CHAPTER 2 


Rotation of Basis States 
and Matrix Mechanics 


In this chapter we will see that transforming a vector into a different vector in our 
quantum mechanical vector space requires an operator. We will also introduce a con¬ 
venient shorthand notation in which we represent ket vectors by column vectors, bra 
vectors by row vectors, and operators by matrices. Our discussion will be primarily 
phrased in terms of the two-state spin-^ system introduced in Chapter 1, but we will 
also analyze another two-state system, the polarization of the electromagnetic field. 

2.1 The Beginnings of Matrix Mechanics 


REPRESENTING KETS AND BRAS 

We have seen that we can express an arbitrary spin state | VO of a spin-^ particle as 


I VO = !+z)<+z|V0 + te-z)(-zlV0 =c+!+z> +c_i-z) (2.1) 


Such a spin state may, for example, be created by sending spin-j particles through 
a Stem-Gerlach device with its magnetic field gradient oriented in some arbitrary 
direction. The complex numbers c± = (±z|V0 tell us how our state IVO ' s oriented 
in our quantum mechanical vector space, that is, how much of |V0 is projected onto 
each of the states j+z) and |— z). 

One convenient way of representing |V0 is just to keep track of these complex 
numbers. Just as we can avoid unit vectors in writing the classical electric field 


E = E x i + E y j + £ 2 k (2.2a) 

by using the notation 

E (E x , E y , E z ) (2.2b) 


29 
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we can represent the ket (2.1) by the column vector 

_ ^n+z\f}\ /c 4 

s z basis \ (— z\\ff) 


\f) 


In this basis, the ket |+z) is represented by the column vector 


l+z) 


-z|+z) 


5, basis \ {—z|-|~z} 

and the ket |— z) is represented by the column vector 


M) 


S z basis 


<+z|-z) 

(-z|-z) 


(2.3) 


(2.4) 


(2.5) 


although the label under the arrow is really superfluous in (2.4) and (2.5) given the 
form of the column vectors on the right. Using (1.29), we can also write, for example. 


l+x) 


S- basis 


(+Z|+x) 

<-z|+x) 


-If 1 
V2 Vl 


( 2 . 6 ) 


How do we represent bra vectors? We know that the bra vector corresponding to 
the ket vector (2.1) is 


(f I = W+z)(+z| + w-z)(-z| =C*(+Z| + c* (~z| 
We can express 

(f \f) = (V'-l+zX+zIVr) + {f\-z)(-z\f) = 1 

conveniently as 

<+z|^> 


Wf) = ((Vr|+z), (ir\-z)) 


bra vector 


(—z|r/r) 


= 1 


(2.7) 


( 2 . 8 ) 


(2.9) 


ket vector 


where we are using the usual rales of matrix multiplication for row and column 
vectors. This suggests that we represent the bra {x[r\ by the row vector 


(f\ -> (W+z). (^Al-z>) (2.10) 

S z basis 

Since (t^|+z) = (+z|t f)* and (i jr\—z) = (—z|^)*, (2.10) can also be expressed as 
(f\ -► «+*W, (~z\f)*) = (C* C*J (2.1 1) 

S z basis 

Comparing (2.11) with (2.3), we see that the row vector that represents the bra 
is the complex conjugate and transpose of the column vector that represents the 
corresponding ket. In this representation, an inner product such as (2.9) is carried 
out using the usual rales of matrix multiplication. 


Page 46 (metric system) 



2.1 The Beginnings of Matrix Mechanics I 31 


As an example, Ve may determine the representation for the ket |— x) in the S z 
basis. We know from the Stern-Gerlach experiments that there is zero amplitude 
to obtain S x = —ft/2 for a state with S x = ft/2, that is, {—x|+x) = 0. Making the 
amplitude (—x|+x) vanish requires that 


l-x> 



S z basis y/2 



since then 


-iS 


(-x|+x) = rr(l, —D— 7 = 
V2 v2 



( 2 . 12 ) 


(2.13) 


Note that the 1/V2 in front of the column vector in (2.12) has been chosen so that 
the ket |—x) is properly normalized: 

I’.^ i 1 \ 

<-*!-*> = ^<W>^ (_,)-■ 

The common convention, and the one that we will generally follow, is to choose the 
overall phase 8 = 0 so that 


1 / 1 


-x 


Sz basis 2 \ — 1 


(2.15) 


However, in Section 2.5 we will see that an interesting case can be made for choosing 
S = n. 

As another example, (1.30) indicates that the state with S v = ft/2 is 


l+y> = —7=l+z> + 

V2 V2 


(2.16) 


which may be represented in the S ,"basis by 


1 /I 


l+y) ' r- 

V2 \i 

The bra corresponding to this ket is represented in the same basis by 

<+yl \ a -0 


(2.17a) 


(2.17b) 


Note the appearance of the — i in this representation for the bra vector. Using these 
representations, we can check that 


1 


< + y'+y > = 7 =a-0^ 


i /i 


= l 


(2.18) 


Page 47 (metric system) 



32 | 2. Rotation of Basis States and Matrix Mechanics 


If we had used the row vector 


y/2 (1 ’ +0 

in evaluating the inner product, we would have obtained zero instead of one. Since 
<-y|+y} = 0, this tells us that in the S z basis 


and thus 


<—yl 


sTi 


( 1 . +0 


(2.19a) 


-y) 


l 

7! 



(2.19b) 


Putting these pieces together, we can use these matrix representations to calculate 
the probability that a spin-1 particle with S x = h/2 is found to have S v = ft,/2 when 
a measurement is carried out: 


|(+y|+x)| 2 = 


7! 


a -o 


i 

71 


2 


1- / 2 _ (1-i) (1 + /) _ 1 

2 2 2 ~ 2 


( 2 . 20 ) 


EXAMPLE 2.1 Use matrix mechanics to determine the probability that a 
measurement of S y yields h/2 for a spin-4 particle in the state 

!V0 = ^I+Z) + “l-z) 

SOLUTION 


!(+y!^)l 2 = 





2 


—7(1 + -s/3) 
2V2 


i ( 4 + 2V3)=i + 


V5 

4 


Compare this relatively compact derivation with the use of kets and bras in 
Example 1.3. 


FREEDOM OF REPRESENTATION 

It is often convenient to use a number of different basis sets to express a particular 
state \f). Just as we can write the electric field in a particular coordinate system as 
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(2.2), we could use a different coordinate system with unit vectors f, j', and k' to 

% 

write the same electric field as 


E — E x \ 4- E x ! \ d - E z !k 


(2.21a) 


or 


E —> (E x f, E yf , E z >) (2.21b) 

Of course, the electric field E hasn’t changed. It still has the same magnitude and 
direction, but we have chosen a different set of unit vectors, or basis vectors, to 
express it. Similarly, we can take the quantum state \ ^r) in (2.1) and write it in terms 
of the basis states |+x) and |— x) as 


I f) = [+x)(+x|V/> + -x}(—X|t/n (2.22) 


which expresses the state as a superposition of the states with S x — ±h/2 multiplied 
by the amplitudes for the particle to be found in these states. We can then consttuct 
a column vector representing |i//) in this basis using these amplitudes: 


m —> ( (+xh/ ' ) ) 

S x basis \ (—X|l/r) ) 

Thus the column vector representing the ket |+x) is 


l+x)-> 

S x basis 


<+x|+x) 

|-x|+x) 



(2.23) 


(2.24) 


which is to be compared with the column vector (2.6). The ket |+x) is the same state 
in the two cases; we have just written it out using the S~ basis in the first case and the 
, S x basis in the second case. Which basis we use is determined by what is convenient, 
such as whal measurements we are going to perform on the state |+x). 


2.2 Rotation Operators 


There is a nice physical way to transform the kets themselves from one basis set 
to another. 1 Recall that within classical physics a magnetic moment placed in a 
uniform magnetic field precesses about the direction of the field. When we discuss 
time evolution in Chapter 4, we will see that the interaction of the magnetic moment 
of a spin-4 particle with the magnetic field also causes the quantum spin slate of the 
particle to rotate about the direction of the field as time progresses. In particular, if 


1 You may object to calling anything dealing directly with kets physical since ket vectors are 
abstract vectors specifying the quantum state of the system and involve, as we have seen, complex 
numbers. 
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the magnetic field points in the y direction and the particle is initially in the state 
|+z), the spin will rotate in the x-z plane. At some later time the particle will be 
in the state |+x). With this example in mind, it is useful at this stage to introduce a 
rotation operator R(yj) that acts on the ket |+z), a state that is spin up along the 
z axis, and transforms it into the ket |+x), a state that is spin up along the x axis: 

|+x) = fl(§j)|+z> (2.25) 

Changing or transforming a ket in our vector space into a different ket requires an 
operator. To distinguish operators from ordinary numbers, we denote all operators 
with a hat. 

What is the nature of the transformation effected by the operator (-f j) ? This 
operator just rotates the ket |+z) by n/2 radians, or 90°, about the y axis (indicated 
by the unit vector j) in a counterclockwise direction as viewed from the positive 
y axis, turning, or rotating, it into the ket |+x), as indicated in Fig. 2.1a. The same 
rotation operator should rotate |—z) into |— x). In fact, since the most general state 
of a spin-1 particle may be expressed in the form of (2.1), the operator rotates this 
ket as well: 

#(fj)l>A} = j) (c+l+z) + C_|-z>) 

= c + /?(fj)|+z)+c_ J R(fj)|—z) 

= c + |+x) + c_|-x) (2.26) 

Note that the operator acts on kets. not on the complex numbers. 2 

THE ADJOINT OPERATOR 

What is the bra equation corresponding to the ket equation (2.25)? You may be 
tempted to guess that (+x| = (+z|/?(|-j), but we can quickly see that this cannot be 
correct, for if it were, we could calculate 3 

(+x|+x) = [<+z|fl(fj)] [*(§j)|+z>] = (+z|/?(f j)/?(fj)|+z> 

We know that (+x|+x) = 1, but since /Op) rotates by 90° around the y axis, 
R(j j)/?(§ j) = /Op performs a rotation of 180° about the y axis. But as indicated 


2 An operator A satisfying 


A(a\f) +b\<p)) =aA\f) + bA\(p) 

where a and b are complex numbers, is referred to as a linear operator. 

3 You can see why we position the operator to the right of the bra vector when we go to calculate 
an amplitude. Otherwise we would evaluate the inner product and the operator would be left alone 
with no vector to act on. 
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Figure 2.1 Rotating |+z) counterclockwise about the y axis 
(a) by tc/2 radians transforms the state into |+x) and (b) by 
7 x radians transforms the state into | — z). The spin state of a 
spin-^ particle with a magnetic moment would rotate in the 
x-z plane if the particle were placed in a magnetic field in the 
y direction. 


in Fig. 2.1b, iR(jrj)|4-z) = |—z), and since (+z|J?(7rj)|+z) = {+z|—z) = 0, we are 
left with a contradiction. 

For the ket vector |t (r) = c + |+z) + c_|—z), the corresponding bra vector is 
(\[r\ = c* (+z| + c* {-z|, with the complex numbers in the ket turning into their 
complex conjugates in the bra. Since we are dealing here with operators and not 
just complex numbers, we need an additional rule for determining the bra equation 
corresponding to a ket equation like (2.25) that involves an operator. We introduce 
a new operator R^, called the adjoint operator of the operator R, so that the bra 
equation corresponding to (2.25) is 

<+x| = (+z|tf t (fj) (2.27) 


We can then satisfy 

1 = <+x|+x) = (+z^ + (fj)/?(fj)|+z) = (+z|+z> (2.28) 

if the adjoint operator R ' is inverse of the operator R. In particular, the adjoint 
operator is a rotation operator that can be viewed as operating to the right on 

the ket f?(fj)l+z). If #(|j) rotates by 90° counterclockwise, then R' (|j) rotates 
by 90° clockwise so that I? + (f j)7?(f j) = 1, and we are left with (+z|+z) = l. 4 

In general, an operator U satisfying U^U = 1 is called a unitary operator. 
Thus the rotation operator must be unitary in order that the amplitude for a state 
to be itself—that is, so that = 1—doesn’t change under rotation. Otherwise, 

probability would not be conserved under rotation. 


4 As this example illustrates, the adjoint operator can act to the right on ket vectors as well to 
the left on bra vectors. 
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(a) (b) 


Figure 2.2 (a) Rotating |+x) by jt/2 radians coun¬ 
terclockwise about the z axis transforms the state into 
|+y). (b) Rotation of a state by an infinitesimal angle 
d4> about the z axis. 

THE GENERATOR OF ROTATIONS 

Instead of performing rotations about the y axis, let's rotate about the z axis. If we 
rotate by 90° counterclockwise about the z axis, we will, for example, turn |+x) into 
|+y), as indicated in Fig. 2.2a. Instead of carrying out this whole rotation initially, 
let us first focus on an infinitesimal rotation by an angle d<j> about the z axis, as 
shown in Fig. 2.2b. A useful way to express this infinitesimal rotation operator is in 
the form 

R(d<pk) = l- -j z d<(> (2.29) 

h 

where we have introduced an operator J z that “generates” rotations about the z axis 
and moves us away from the identity element. Our form for R(d(pk) clearly satisfies 
the requirement that R(d4> k) —x 1 as dcj> -> 0. As we will see, the factor of i and 
the factor of h have been introduced to bring out the physical significance of the 
operator J z . In particular, because the factor of h occurs in the denominator of 
the second term in (2.29), the operator J z must have the dimensions of fi, namely, 
the dimensions of angular momentum. We will see that a convincing case can be 
made that we should identify this operator the generator of rotations about the 
z axis, with the z component of the intrinsic spin angular momentum of the particle. 

We first establish that J z belongs to a special class of operators known as Hermi- 
tian operators. Physically, the operator R\d(j) k) is the inverse of the rotation operator 
R(d(pk). By taking the adjoint of (2.29), we can write this operator in the form 

R ] '(d<t>k) = 1 + (2.30) 

where 7 j is the adjoint of the operator J z . Note that since the bra corresponding to the 
ket c\\js) is (rj/ |c*, complex numbers get replaced by their complex conjugates when 
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forming the adjointjpperator. Thus i —► —/ in going from (2.29) to (2.30), which 
has the same effect as changing d(p to -d(j), and therefore R^idcpk) — R(-dcp k), 
provided J' z — J z . More formally, since the rotation operator R^(d(p k) is the inverse 
of the rotation operator R{d<pk), these operators must satisfy the condition 


R f (dcpk)R(d(pk) = (l - jJ z d$j 

= 1 + ~ (./.J — 7 Z ) d<p + 0(d(p 2 ) = 1 (2.31) 

Since the angle dtp is infinitesimal, we can neglect the second-order terms in d<p> and 
(2.31) will be satisfied only if J z = . In general, an operator that is equal to its 

adjoint is called self-adjoint, or Hermitian. Thus ./, must be a Hermitian operator. 
Hermitian operators have a number of nice properties that permit them to play major 
roles in quantum mechanics. After some specific examples, we will discuss some of 
these general properties in Section 2.8 . 5 

One of the reasons that infinitesimal rotations are useful is that once we know 
how to perform an infinitesimal rotation about the z axis by an angle d(p, we can 
carry out a rotation by any finite angle <p by compounding an infinite number of 
infinitesimal rotations with 

dtp = lim — 

N-*o o N 


The rotation operator R(<pk) is then given by 


R(<pk) = lim 

N-+CO 


l- J, 


h z \N 


<P 




(2.32) 


The last identity in (2.32) can be established by expanding both sides in a Taylor 
series and showing that they agree term by term (see Problem 2.1). In fact, a 
series expansion is really the only way to make sense of an expression such as an 
exponential of an operator. 


EIGENSTATES AND EIGENVALUES 

What happens to a ket |+z) if we rotate it about the z axis—that is, what is 
R((p k)|+z)? If you were to rotate a classical spinning top about its axis of rota¬ 
tion, it would still be in the same state with its angular momentum pointing in the 
same direction. Similarly, rotating a state of a spin -5 particle that is spin up along 
z about the z axis should still yield a state that is spin up along z, as illustrated in 


5 Now you can see one reason for introducing the i in the defining relation (2.29) for an 
infinitesimal rotation operator. Without it, the generator /, would not have turned out to be 
Hermitian. 
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Figure 2.3 (a) Rotating |+z) by angle 0 about the 
z axis with the operator R(4> k) does not change the 
state, in contrast to the action of the operator R(9j), 
which rotates |+z) by angle 0 about the y axis, 
producing a different state, as indicated in (b). 


Fig. 2.3. In Chapter 1 we saw that the overall phase of a state does not enter into the 
calculation of probabilities, such as in (1.24). This turns out to be quite a general 
feature: two states that differ only by an overall phase are really the same state. We 
will now show that in order for ft(0k)|+z) to differ from |+z) only by an overall 
phase, it is necessary that 


J z \+ z) = (constant) |+z) 


(2.33) 


In general, when an operator acting on a state yields a constant times the state, we call 
the state an eigenstate of the operator and the constant the corresponding eigenvalue. 

First we will establish the eigenstate condition (2.33). If we expand the exponen¬ 
tial in the rotation operator (2.32) in a Taylor series, we have 


tf(0k)|+z) = 



i<pJ z 


H- 

2! 



(2.34) 


If (2.33) is not satisfied and J z |+z) is something other than a constant times |+z), 
such as |+x), the first two terms in the series will yield |+z) plus a term involving 
|+x), which would mean that R (0k) |+z) differs from |+z) by other than a mul¬ 
tiplicative constant. Note that other terms in the series cannot cancel this unwanted 
|+x) term, since each term involving a different power of 0 is linearly independent 
from the rest. Thus we deduce that the ket |4-z) must be an eigenstate, or eigenket, 
of the operator J z . 

Let’s now turn our attention to the value of the constant, the eigenvalue, in (2.33). 
We will give a self-consistency argument to show that we will have agreement with 


Page 54 (metric system) 



2.2 Rotation Operators I 39 


the analysis of the Stern-Gerlach experiments in Chapter I provided 

J z \± a) = ±||±as) (2.35) 

This equation asserts that the eigenvalues for the spin-up and spin-down states are the 
values of S z that these states are observed to have in the Stern-Gerlach experiments. 6 
First consider the spin-up state. If 

J z [+z) = ^|+z> (2.36a) 


then 


J- ir- 

irl+z) = j z — i+z) = —J z i+z) = 



(2.36b) 


and so on. From (2.34), we obtain 


R(<t>k)\+z,) 


i± (ijP 

2 2! V 2 


2 

+ * * * 


! I z) = e 10/2 |+z) 


(2.37) 


The state has picked up an overall phase, just as we would hope if the state is not to 
change. The value of the phase is determined by the eigenvalue in (2.36a). 

In order to see why the eigenvalue should be h/ 2, let's consider what happens if 
we rotate a spin-down state |—z) about the z axis, that is, if we evaluate R(<pk) | —z). 
Just as before, we can argue that | -z) must be an eigenstate of J z . We can also argue 
that the eigenvalue for [—z> must be different from that for (+z). After all, if the 
eigenvalues were the same, applying the rotation operator R(<pk) to the state 


+*> 




-z) 


(2.38) 


would not rotate the state, since |+z) and |— z) would each pick up the same phase 
factor, and the state in (2.38) would itself pick up just an overall phase. Therefore, 
it would still be the same state. But if we rotate the state |+x) by an angle <p in the 
A-y plane, we expect the state to change. If we try 

J z [—z> = -fl-z) (2.39) 


for the eigenvalue equation for the spin-down state, we find 


R(ct> k)|—z> 



1 

2 ! 



2 

H- 


t-z) 


(2.40) 


6 You can start to see why we introduced a factor of 1/ft in the defining relation (2.29) between 
the infinitesimal rotation operator and the generator of rotations. 
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Using (2.37) and (2.40), we see that 

e ~m 2 J0/2 

R(4> k)|+x) = —— l+z) + —p-l-z) 

V2 V2 

= e -W,(_L | +l) + £j! M) ) (2,4.) 

which is clearly a different state from (2.38) for <p ^ 0. In particular, with the choice 
<p — n/2, we obtain 


R( §k)|+x) =<T i7r / 4 

= ,- /4 (^l+ z ) + ^|- Z ))=^ /4 l+y) (2.42) 

where we have replaced the term in the brackets by the state | +y) that we determined 
in (1.30). Since two states that differ only by an overall phase are the same state, 
we see that rotating the state |+x) by 90° counterclockwise about the z axis does 
generate the state |+y) when (2.35) holds. Thus we are led to a striking conclusion: 
When the operator that generates rotations about the z axis acts on the spin-up-along- 
z and spin-down-along-z states, it throws out a constant (the eigenvalue) times the 
state (the eigenstate); the eigenvalues for the two states are just the values of the z 
component of the intrinsic spin angular momentum that characterize these states. 

Finally, let us note something really perplexing about the effects of rotations on 
spin-^ particles: namely. 


\+z) + 


? in/2 

7f 


|-z) 


fi(27rk)|+z) = e W7r |+z) = -j+z> (2.43a) 


and 


«(27Tk)|+z> = e in \-z) = — | —z) (2.43b) 

Thus, if we rotate a spin-4 state by 360° and end up right where we started , we 
find that the state picks up an overall minus sign. Earlier we remarked that we could 
actually perform these rotations on our spin systems by inserting them in a magnetic 
field. When we come to time evolution in Chapter 4, we will see how this strange 
prediction (2.43) for spin-4 particles may be verified experimentally. 


EXAMPLE 2.2 Show that rotating the spin-up-along-* state |+x) by 180° 
about the z axis yields the spin-down-along-.r state. 
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SOLUTION 4 


J?(;rrk)|+x) = tf(jrk) ( —|+z> + —|—z) 
\v2 s/i 

e -iTc/2 e iit /2 

= cr l+z)+ 


= e ^' jT/2 


— |+z) + — |-z) 

V 2 >/2 


^ /2 [ 1 l +z) _ 1 l_ z) 


= e - inll \-y 


where in the last line we have used the phase convention for the state |— x) 
given in (2.15). 


2.3 The Identity and Projection Operators 

In general, the operator R(6n) changes a ket into a different ket by rotating it by 
an angle 0 around the axis specified by the unit vector n. Most operators tend to do 
something when they act on ket vectors, but it is convenient to introduce an operator 
that acts on a ket vector and does nothing: the identity operator. Surprisingly, we 
will see that this operator is a powerful operator that will be very useful to us. 

We have expressed the spin state | \j/) of a spin-| particle in the S, basis as 
|i/r) = |+z)(+z|i//> + |—z){—z|t/r). Wecan think ofthe rather strange-looking object 

l+z){+z| + I—z)<—z| (2.44) 

:'-*r 

as the identity operator. It is an operator because when it is applied to a ket, it yields 
another ket. Moreover, if we apply it to the ket |i/r), we obtain 

(|+z)(+z| + |-z)<-z|)|V0 = |+z)(+z|i/r> + |-z)(-z|t/r) - | if) (2.45) 

We earlier discussed a nice physical mechanism for inserting such an identity 
operator when we analyzed the effect of introducing a modified Stern-Gerlach 
device in Experiment 4 in Chapter 1. Here, since we are expressing an arbitrary 
state \\js) in terms of the amplitudes to be in the states |+z) and |—z), we use a 
modified SG device with its magnetic field gradient oriented along the z direction, 
as shown in Fig. 2.4a. The important point that we made in our discussion of the 
modified SG device was that because we do not make a measurement with such a 
device, the amplitudes to be in the states |+z) and |—z) combine together to yield 
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(a) 



(b) 



(c) 

Figure 2.4 fa) A modified Stem-Gerlach device serves as the identity 
operator, (b) Blocking the path that a spin-down particle follows produces the 
projection operator P + . (c) Blocking the path that a spin-up particle follows 
produces the projection operator P_. 


the same state exiting as entering the device, just as if the device were absent. Hence, 
it is indeed an identity operator. 

The identity operator (2.44) may be viewed as being composed of two operators 
called projection operators: 

P+ = |+z)(+z| (2.46a) 

and 

P- = I—z)<—z| (2.46b) 

They are called projection operators because 

P+\f) = \+z)(+z\^) (2.47a) 

projects out the component of the ket | i//} along |+z) and 
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I P_\^r) = \-z)(- z \f) (2.47b) 

projects out the component of the ket jr/r) along j— z). 7 That (2.44) is the identity 
operator may be expressed in terms of the projection operators as 

P+ + = 1 (2.48) 

This relation is often referred to as a completeness relation. Projecting onto the 
two vectors corresponding to spin up and spin down are the only possibilities for a 
spin-| particle. As (2.45) shows, (2.48) is equivalent to saying that an arbitrary state 
IV' ) can be expressed as a superposition of the two basis states |+z) and |—z>. 

Notice that if we apply the projection operator P + to the basis states j+z) and 
| —z), we obtain 


■P+l+z) — j+z)(+z|+z) — |+z) (2.49a) 


and 


P + |—z} = j+z)<+z|-z)=0 (2.49b) 

Thus |+z) is an eigenstate of the projection operator P + with eigenvalue 1, and 
|—z) is an eigenstate of the projection operator P + with eigenvalue 0. We can obtain 
a physical realization of the projection operator P + from the modified SG device 
by blocking the path that would be taken by a particle in the stale [—z), that is, by 
blocking the lower path, as shown in Fig. 2.4b. Each particle in the state |+z) entering 
the device exits the device. We can then say we have obtained the eigenvalue 1. Since 
none of the particles in the state |—z) that enters the device also exits the device, we 
can say we have obtained the eigenvalue 0 in this case. 

Similarly, we can create a physical realization of the projection operator P_ by 
blocking the upper path in the modified SG device, as shown in Fig. 2.4c. Then each 
particle in the state |— z) that enters-the device also exits the device: 

P z) = | —z) (—z| —z) = | —z) (2.50a) 

while none of the particles in the state |+z) exits the device: 

A. j+z) = |-z)(-z|+z) =0 (2.50b) 

Hence the eigenvalues of P are 1 and 0 for the states |—z) and |+z), respectively. 


7 Notice that the projection operator may be applied to a bra vector as well: 

(f\P+ = (f\+x)(+z\ (#|F = | 
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(b) 


Figure 2.5 Physical realizations of (a) P~ = P + and (b) P P + — 0. 

Notice that each of the particles that has traversed one of the projection devices 
is certain to pass through a subsequent projection device of the same type: 

Pi = (|+z)(+z|)(|+z)(+z|) 

= |+z)(+z|+z)(+z| = |+z){+z| = P + (2.51a) 

P 2 = (I z)( z|)(| z)( zj) 

= I z)( z| z)( z| = |—z)(—z| - P_ (2.51b) 

while a particle that passes a first projection device will surely fail to pass a subse¬ 
quent projection device of the opposite type: 


P +P- = (|+z)(+z|)(|-z)(-z|) 


= |+z}(+z|-z)<-z| =0 

(2.52a) 

/LP + = (|-z)(-z|)(|+z)<+z|) 


= |-z)(-z|+z)(+z| =0 

(2.52b) 

These results are illustrated in Fig. 2.5. 


Our discussion of the identity operator and the projection operators has arbitrarily 
been phrased in terms of the S z basis. We could as easily have expressed the same 

state |i //) in terms of the S x basis as \ f) = |+x)(+x|^) + j 
can also express the identity operator as 

—x)(—x|i/f). Thus we 

|+x)(+x| + | x)( x| = 1 

(2.53) 


and view it as being composed of projection operators onto the states |+x) and |— x). 

Let’s use this formalism to reexamine Experiment 4 of Chapter 1. in this exper¬ 
iment a particle in the state |+z) passes through a modified SGx device and then 
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enters an SGz device. Since the modified SGx device acts as an identity operator, 
the particle entering the last SGz device is still in the state |+z) and thus the ampli¬ 
tude to find the particle in the state |—z) vanishes: <—z|+z) = 0. There is, however, 
another way to express this amplitude. We use the identity operator (2.53) to express 
the initial ket in tenns of the amplitudes to be the states |+x) and |—x): 

|+z) = |+x)(+x|+z) + |-x)(-x|Tz) (2.54) 

Then we have 

(-Z|+z) = <-z|+x)(+x|+z) + ( z| x)( x|Tz) (2.55) 

Thus the amplitude for a particle with S z — h/2 to have S, — — h/2 has now been 
written as the sum of two amplitudes. We read each of these amplitudes from right 
to left. The first amplitude on the right-hand side is the amplitude for a particle with 
S z = h/2 to have S x = h/2 times the amplitude for a particle with S x = h/2 to have 
S, = —h/2. The second amplitude is the amplitude for a particle with S, = h/2 
to have S x = —h/2 times the amplitude for a particle with S x = —h/2 to have 
S z = —h/2. Notice that we multiply the individual amplitudes together and then add 
the resulting two amplitudes with the |+x) and |—x) intermediate states together to 
determine the total amplitude. 

We now calculate the probability: 

|{—z|+z)| 2 = [(—z|+x}| 2 |{+x|+z)| 2 + |{—z|—x)| 2 |(—x|+z)| 2 

+ (-z|+x){+x|+z)(-z|-x)*(-x|+z)* 

+ (-z|+x)*(+x|+z)*(-z|-x){-x]+z) (2.56) 

This looks like a pretty complicated way to calculate zero, but it is interesting to 
examine the significance of the four terms on the right-hand side. The first term is 
just the probability that a measurement of S x on the initial state yields h/2 times 
the probability that a measurement Wf S z on a state with S x = h/2 yields —h/2. The 
second term is the probability that a measurement of S x on the initial state yields 
-h/2 times the probability that a measurement of S z on a state with S x — -h/2 
yields —h/2. These two terms, which sum to are just the terms we would have 
expected if we had made a measurement of S x with the modified SGx device. But 
we did not make a measurement and actually distinguish which path the particle 
followed in the modified SGx device. 8 Thus there are two additional terms in (2.56), 
interference terms, that arise because we added the amplitudes on the right-hand 
side together before squaring to get the probability. You can verify that these two 


8 It should be emphasized that a measurement here means any physical interaction that would 
have permitted us in principle to distinguish which path is taken (such as arranging for the particle 
to leave a track in passing through the modified SG device), whether or not we actually choose to 
record this data. 
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No/4 

No/4 


No/4 
No/4 


(a) 



(b) 

Figure 2.6 Block diagrams of experiments with SG devices in which 
(a) a measurement of S x is carried out, illustrating |(— z| —x)| 2 |{—x|+z}| 2 + 
|{—z|+x)| 2 |{+x[+z)| 2 = 4; and (b) no measurement of S x is made, either by 
inserting a modified SGx device between the two SGz devices or by simply 
eliminating the SGx device pictured in (a), illustrating |{— z|—x){—x|+z) + 
{-z|+x)(+x|+z)| 2 = |(-z|+z)| 2 = 0. 


interference terms do cancel the first two probabilities. These results are summarized 
in Fig. 2.6. In more general terms, if you do not make a measurement, you add the 
amplitudes to be in the different (indistinguishable) intermediate states, whereas if 
you do make a measurement that would permit you to distinguish among these states, 
you add the probabilities. 

Finally, it is convenient to introduce the following shorthand notation. For a given 
two-dimensional basis, we can label our basis states by ]1) and |2). We can then 
express the identity operator as 

2>X*'I = 1 (2.57) 

I 

where the sum is from i = 1 to i = 2. The straightforward generalization of this 
relationship to larger dimensional bases will be very useful to us later. 

2.4 Matrix Representations of Operators 


In order to change, or transform, kets, operators are required. Although one can 
discuss concepts such as the adjoint operator abstractly in terms of its action on the 
bra vectors, it is helpful to construct matrix representations for operators, making 
concepts such as adjoint and Hermitian operators more concrete, as well as providing 
the framework for matrix mechanics. Equation (2.25) is a typical equation of the form 

A\f) = \<p) (2.58) 
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where A is an operltor and t/0 and \(p) are, in general, different kets. We can also 
think of the eigenvalue equation (2.35) as being of this form with | ip) just a constant 
times \\fr). Just as we can express a quantum spin state \xj/) using the S z basis states by 

\f) = |+z)(+z|i/r) + |—z)(—zj f) (2.59) 

we can write a comparable expression for \cp): 

\<p) = |+z}(+z|<p) + |-z)(-z|^) (2.60) 


Thus (2.58) becomes 

A (|+z>(+z|i/r) + I—z)(-z|yr» = |+z}{+z|</>) + |-z)(-z| <p) (2.61) 


In ordinary three-dimensional space, a vector equation such as F = m a is really 
the three equations: F x = ma x , F v = ma y , and F z = ma z . We can formally obtain 
these three equations by taking the dot product of the vector equation with the basis 
vectors i, j, and k; for example, i • F = i - ma yields F x — ma x . Similarly, we can 
think of (2.61) as two equations that we obtain by projecting (2.61) onto our two 
basis states, that is, by taking the inner product of this equation with the bras (+z| 
and (—z|: 


<+z|i|+z)(+z|i/r) + {+z|A|—z){—z|i/0 = (+z|^> (2.62a) 


and 

(—z| A |+z) (+z|i/r) + <—z|A|—z)<—z|i//> = <-z| (p) (2.62b) 


These two equations can be conveniently cast in matrix form: 

/ (+z|A|+z) <+z|A|—z)\ /<+z|r/r)\ / (+z|«p) \ ^ 

\ (—z|A|+z) {—z|A|—z) / \ (—z|V/) / \(-z|9) / 

In the same way that we can represent a ket |t/t) in the .S(. basis by the column vector 


\f) - 

S z basis 


{+zh k) \ 

( z| t/r) ) 


(2.64) 


we can also represent the operator A in the S z basis by the 2 x 2 matrix in (2.63). 
Just as for states, we indicate a representation of an operator with an arrow: 


A-> 

S z basis 


(+z|A|+z) 

{-z|A|+z> 


(+z|A|—z) 
(-z|A|-z) 



(2.65) 


If we label our basis vectors by 11) and |2) for the states | +z) and | —z), respectively, 
we can express the matrix elements A,- ; in the convenient form 


A iJ = (i\A\j) 


( 2 . 66 ) 
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where i labels the rows and j labels the columns of the matrix. Note that knowing 
the four matrix elements in (2.63) allows us to determine the action of the operator 
A on any state \tfr). 


MATRIX REPRESENTATIONS OF THE PROJECTION OPERATORS 

As an example, the matrix representation of the projection operator P + is given by 

/( +I |r + |«, < + *Ai-*)W> ox 

+ S z basis \(-z|P + |+Z> {-Z|P + |-Z>/ VO 0/ 

where we have taken advantage of (2.49) in evaluating the matrix elements. Similarly, 
the matrix representation of the projection operator P_ is given by 




(2.67b) 


Thus, the completeness relation P + + P_ — 1 in matrix form becomes 

1 °j + (° °W' °)=, 

oo/ V o i / V o i / 


( 2 . 68 ) 


where 1 is the identity matrix. The action of the projection operator P + on the basis 
states is given by 

(2.69a) 

(2.69b) 

in agreement with equations (2.49a) and (2.49b), respectively. 



MATRIX REPRESENTATION OF J, 

As another example, consider the operator the generator of rotations about the 
z axis. With the aid of (2.35), we can evaluate the matrix elements: 



S z basis 


n+z\j z \+z) (+z|i z |-z) \ 
V<-z|JJ+z) (-z|/J-z)/ 


/ (fi/2)(+z|+z) 
V (h/2)(-z\+z) 
h/2 0 \ 

0 -h/2) 


(-n/ 2)(+z|-z)\ 
(-H/ 2)(—z|—z)J 


(2.70) 
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The matrix is diagonal with the eigenvalues as the diagonal matrix elements be¬ 
cause we are using the eigenstates of the operator as a basis and these eigenstates 
are orthogonal to each other. The eigenvalue equations J z |+z) = (h/ 2)|+z) and 
J z |—z) = (— h/2)\— z) may be expressed in matrix mechanics as 


and 



(2.71) 



(2.72) 


respectively. Incidentally, we can write the matrix representation (2.70) in the form 

° °)-*(° °) ,2.73a, 

Szbask V o -n/2) 2 \ 0 0/ 2 Vo 1 / 

which indicates that 

h = \h~\P- = ^+*><+*1 - ||-z)<-z| (2.73b) 

We could have also obtained this result directly in terms of bra and ket vectors by 
applying J z to the identity operator (2.48). 


EXAMPLE 2.3 Obtain the matrix representation of the rotation operator 
R(<pk) in the S z basis. 


SOLUTION Since R(<pk) = e 'T^ and e lJ ^^ h \±x) == e Tl ^^ 2 \dnz) 

/ 0 

s z brfs V 0 e 


R( 4 > k) 


This matrix is diagonal because we are using the eigenstates of .7, as a basis. 


MATRIX ELEMENTS OF THE ADJOINT OPERATOR 

We next form the matrix representing the adjoint operator AV If an operator A acting 
on a ket |i (r) satisfies 


A\f) = | <p) 


(2.74) 


then, by definition. 


W A^ = {(p\ 


(2.75) 
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ly/> -—-*• A|i//> 



Figure 2.7 The adjoint operator of an operator A is 
defined by the correspondence between bras and kets. 


(See Fig. 2.7.) If we take the inner product of (2.74) with the bra (x |, we have 

(X\W) = (X\<P) (2.76) 

while taking the inner product of (2.75) with the ket |x), we obtain 

(^|A t lx) = Mx) (2.77) 

Since (xl^) = (^lx)*’ we see that 

W^\x) = (x\A\f)* (2.78) 

This straightforward but important result follows directly from our definition (2.75) 
of the adjoint operator. It can be used to tell us how the matrix representations of an 
operator and its adjoint are related. If we replace |y!r) and |x) with basis states such 
as |+z) and |—z), we obtain 


(ilA^j) = (j\A\>)* 


(2.79) 


We denote this as 


a;,. 



(2.80) 


which tells us that the matrix representing the operator A + is the transpose conjugate 
of the matrix representing A. We can define the adjoint matrix A 1 as the transpose 
conjugate of the matrix A. 

We also find another important result. Since by definition a Hermitian operator 
A satisfies A = A', then (i\A\j ) = (j\A\i)*, showing that the matrix representation 
of a Hermitian operator equals its transpose conjugate matrix. Our terminology for 
adjoint and Hermitian operators is consistent with the terminology used in linear 
algebra for their matrix representations. We can now see from the explicit matrix 
representations of the operators P + in (2.67) and J z in (2.70) that these are Hermitian 
operators, since the matrices are diagonal with real elements (the eigenvalues) on the 
diagonal. In Chapter 3 we will see examples of Hermitian operators with off-diagonal 
elements when we examine the matrix representations for J x and J v for spin-^ and 
spin-1 particles. 
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Figure 2.8 The adjoint of the product of operators is 
determined by the correspondence between bras and kcts. 


THE PRODUCT OF OPERATORS 

We often must deal with situations where we have a product of operators, such as 
(2.51), which involves the product of two projection operators. Another way such a 
product of operators might arise is to perform two successive rotations on a state. To 
obtain the matrix representation of the product AB of two operators, we first form 
the matrix element 

{i\AB\j) 

If we insert the identity operator (2.57), we obtain 

(i\AB\j) = (i\A [£|ifc)(*|) B\j) = '%2(i\A\k){k\B\j) = '£ i A ik B k j (2 ' 81 ) 
\k / k k 

which is the usual rule for the multiplication of the matrices representing A and B. 

What is the adjoint operator for the product AB of two operators? As Fig. 2.8 
shows, 

{AB) t = B t A t (2-82) 

5" 


EXAMPLE 2.4 Use matrix mechanics to show that P* — P + , P 2 _ = P_, 
and P+P_ = 0. 

SOLUTION 
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2.5 Changing Representations 


The rotation operator R' can be used to rotate a ket h/0 into a new ket | i//) in an 
active transformation: 


W) = R'\f) (2.83) 

Recall that the rotation operator R 1 is just the inverse of the rotation operator R. so 
if R rotates the state counterclockwise about the axis n by some angle (9, then R' 
rotates the state clockwise about the axis n by the same angle 6 : 

R 1 (6n) = R(-da) (2.84) 

We can form a representation for the ket \f r ) in the S z basis, for example, in the 
usual way: 

,*-> — = (2 . 85) 

S. basis \ (-Z;l// ) / \{-z\R'\f) J 

There is, however, another way to view this transformation. Instead of the operator 
R ^ acting to the right on the ket, we can consider it as acting to the left on the bras. 
From our earlier discussion of the adjoint operator, we know that kets corresponding 
to the bras (±z|R^ are R |±z). Since R is the inverse of the operator R', we see that 
instead of R' rotating the state ft/r) intoanew state \ f) asin(2.83), we may consider 
the operator R' in (2.85) to be performing the inverse rotation on the basis states that 
are used to form the representation. 

Let's take some specific examples to illustrate. In Problem 3.5 it is shown that 

|+x) = R(fj)|+z) (2.86) 


where 


l+x) = 


1 

71 


i+z) + 


1 


-z) 


From (2.42) we see that 


(2.87) 


R(^k)\+x) = c - in/4 

which as we noted differs from the state we have defined as |+y) by the overall phase 
factor of An alternative would be to define |+y) = R(fk)|;+x) including this 

phase factor. Similarly, we would define the state | —x) as one that is obtained by 


l+z) + 


s/2 


-z) 


( 2 . 88 ) 


I—X) = R(fj)|—z) 


(2.89) 
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that is by a rotation of the state |—z) by 90° around the y axis. Following this 
procedure, as Problem 3.5 shows, we find that 


|-x) = —^|+z) + —t= I —z) 

s/2 V2 


(2.90) 


which differs from (2.15) by an overall minus sign. 

We will use the states j+x) and | —x) shown in (2.87) and (2.90) for the remainder 
of this section since it is convenient to focus our discussion on basis states that 
are related to the states by |+z) and |— z) by application of a rotation operator, 
specifically 


|±x) = R(§j}|±z> (2.91a) 

and therefore 

<±x| = (±z|tf + (fj) (2.91b) 

If we take the operator R r in (2.85) to be the specific rotation operator A* '( ^ j), then 
when this operator acts to the left on the bra vectors it transforms the S z basis to the 
$ x basis according to (2.91b). But if /c (f j) acts to the right, it generates a new state 

= (2.92) 


We can summarize our discussion in the following equation: 

, _^ \ / (+ZiA f (fj)|l/r) \ / (+X|l//) 

Sz basis \(- Z |t/r')/ V ) V ( Xh//) 

(2.93) 

Read from the left, this equation gives the representation in the S z basis of the state 
it jj’) that has been rotated by 90° clockwise around the y axis, whereas read from 
the right, it shows the state |t/0 aslfeeing unaffected but the basis vectors being ro¬ 
tated in the opposite direction, by 90° counterclockwise around the y axis. Both of 
these transformations lead to the same amplitudes, which we have combined into 
the column vector in (2.93). This alternative of rotating the basis states used to form 
a representation is often referred to as a passive transformation to distinguish it 
from an active transformation in which the state itself is rotated. A passive trans¬ 
formation is really just a rotation of our coordinate axes in our quantum mechanical 
vector space, as illustrated in Fig. 2.9. 9 



9 If (2.43) did not seem sufficiently strange to you. try considering it from the perspective 
of a passive transformation. If we rotate our coordinate axes by 360° and end up with the same 
configuration of coordinate axes that we had originally, we find the state of a spin-j- particle has 
turned into the negative of itself. 
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-V v 




Figure 2.9 (a) Rotating a state by angle 4> counter¬ 
clockwise about an axis is equivalent to (b) rotating 
the coordinate axes by the same angle in the opposite 
direction, keeping the state fixed. 


Equation (2.93) suggests a way to relate the column vector representing the ket 
I f) in one basis to the column vector representing the same ket in another basis. If 
we start with the representation of the ket | f) in the S x basis and insert the identity 
operator, expressed in terms of S z basis states, between the bra and the ket vectors, 
we obtain 


/ (+x| l/r) \ _ / (+x|+z) (+x|—z) \ / (+z\f) \ 

V(-x| f)J \ ( X| + Z) (—x[—Z) / \ {—z|^r) / 

= / (+Z|i t (|j)|+Z) (+z|^t(|j)|_ 2 ) \ / (+z| f) \ 

V {-Z|^ + (f j)l+z) (-z|/? + (fj)|-z) / V (—z|l/r) j U94) 


where the second line follows from (2.91b). We call the 2 x 2 matrix in (2.94) 
S f , or more precisely in this specific example S f (|j), since it is really the matrix 
representation in the S z basis of the operator R ’( j j) that rotates kets by 90° clockwise 
about the y axis. Equation (2.94) transforms a given ket \ f) in the S z basis into the 
S x basis. 

We can transform from the S x basis to the S z basis in analogous fashion: 


/ (+z| ifr) \ _ / (+z|+x) (+z|-x) \ / (+x|l/r) \ 

V (—z| l/r) 7 V (-Z)+X) (-zl-x) J \ <-X|^) / 

= / (+Z|^(fj)|+z) {+z|J?(f j)|-z) \ / (+x|f) \ 7 

V (-Z|i?(§j)!+Z> (-z(i(fj)l-z) ) V {—Xl^r) ) U ' 95) 


where in the first line we have inserted the identity operator, this time expressed in 
terms of the S x basis states. Also we have used (2.91 a) to express the 2x2 matrix in 
the second line of the equation in terms of the matrix representation of the operator 
Comparing the first lines of (2.94) and (2.95) reveals that the 2 x 2 matrix in 
(2.95) is the matrix S, the adjoint matrix of the matrix S t , since the matrix elements 
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of S are simply obtained from the matrix elements of S' by taking the transpose 
conjugate. Also, a comparison of the second lines of (2.94) and (2.95) shows that 
the 2 x 2 matrix in (2.95) is the matrix representation of AN 4 ,j). while the 2 x 2 
matrix in (2.94) is the matrix representation of 7 c(fj). Since the rotation operators 
are unitary, the matrices must satisfy 


S f S = I 


(2.96) 


which can also be verified by substituting equation (2.95) directly into equation 
(2.94). 

We can now determine how the matrix representation of an operator in one basis 
is related to the matrix representation in some other basis. For example, the matrix 
representing an operator A in the S x basis is given by 

2 /(+x|A|+x) 

S x basis y (— X ! A|+x) 

A typical matrix element can be expressed as 


<+x|A|-x) \ 
( _ xj A|—x) ) 


(+x|A |—x) = (+z|A + (§j)Afl(§j)[-z) 


Inserting the identity operator (2.44) before and after the operalor A on the left-hand 
side or between each of the operators on the right-hand side [or using result (2.81) 
for the matrix representation of the product of operators] permits us to write 

A --> S + AS (2.98) 

S x basis 


where A is the matrix representation of A in the S z basis. 10 

Let’s take the example of evaluating the matrix representation of ./. in the S x 
basis. Using (2.87) and (2.90) to evaluate the matrix § in (2.95), we find 


(+z|+x) (+z|-x) 



(-z|+x) ( z| x) 


— ( 1 
■s/2 V 1 



(2.99) 


10 The first lines of (2.94) and (2.95) fonn a good advertisement for the power of the identity 
operator. Rather than trying to remember such equations, it is probably easier and safer to derive 
them whenever needed by starting with the matrix elements (or amplitudes) that you are trying to 
find and inserting the identity operator from the appropriate basis set in the appropriate place(s). 
In this way we can work out the matrices in (2.98): 


/ (+x|A|+x) |+x|A|-x) \ 

V (-x|A]+x) <-x|A!—x)/ 


/ (+x|+z) (+xj-z) \ / (+z|A|+z) (+z!A|-z> \ / (+z| +x> (+zj-x)\ 
V (-xl+z) ( x[ z) / V (~z|Aj+.z) (—z|A|—z> / V (~z|+x) (-z)-x) / 
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Carrying out the matrix multiplication (2.98) using the matrix representation of J z 
in the S z basis from (2.70), we obtain 

j _^ ±( 1 '\h/l 0 \ j_/l 

z S x basis > V2 V -1 1 / 2 \ 0 -17V2V1 

( 2 . 100 ) 

Comparing (2.100) with (2.70), we see that the matrix representation of the operator 
is no longer diagonal, since we are not using the eigenstates of the operator as the 
basis. 11 

If we also take advantage of (2.94) to express the eigenstate |+z) in the S x basis, 
l+z)- > —=( ! = (2.101) 

S x basis ^2 \ — 1 1 / V 0 / \/2 V — 1 / 



we can express the eigenvalue equation J, | +z) = (hj 2)|+z) in the S x basis: 


n/o -i\ j_ 

2 \ —1 0 ) s /2 



( 2 . 102 ) 


Compare (2.102) with (2.71), where the same equation is written in the S z basis. 
Note that the eigenvalue equation is satisfied independently of the basis in which 
we choose to express it. This eigenvalue equation in its most basic form deals with 
operators and states, not with their representations, which we are free to choose in 
any way we want. 

Before leaving this section, it is worth emphasizing again what we have learned. 
The S-matrices give us an easy way to transform both our states and our operators 
from one matrix representation to another. As the first line in both equations (2.94) 
and (2.95) shows, these S-matrices are composed of the ampl itudes formed by taking 
the inner product of the basis kets of the representation we are transforming/ram with 
the basis bras of the representation we are transforming to. It is often convenient, 
however, to return to the active viewpoint with which we started our discussion. 
Instead of the S-matrices transforming a given state from one basis to another, we 
can view the S-matrix as the matrix representation of the rotation operator that rotates 
the given state into a different state within a fixed representation. This will be our 
starting point in Chapter 3. As we have seen, an active rotation that transforms the 


11 Alternatively, we could evaluate the matrix representation of /, in the S x basis by expressing 
the basis states |±x) in terms of |±z) so that we can let./, act on them directly. For example, the 
element in the first row, second column of (2.1 (X)) is given by 


(+x|./ z |-x) = - (<+z| + {- 


"z|) J z (-I+Z) + j-z)) 


2 

2 


«+z|+(-z|) 


( ft, , ft, 

-H"Z) — - 

v 2 2 


ft. 

2 
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state is just the inverse of the passive rotation that transforms the basis vectors used 
to form a particular representation. 


EXAMPLE 2.5 The first lines of (2.94) and (2.95) as well as equation (2.98) 
and its inverse can be used to switch back and forth between the S z and S x 
bases for basis states such as 


l+x) = Ti l+Z) + ^ , - x) = ^ l+z> ~7! l ~ z) 


even though in this case the S-matrix 


g _/<+z|+x) (+z|—x)\ 

V <-z|+x) (-z|-x) / 

is not the matrix representation of the rotation operator. Determine S for these 
basis states and use it to repeat the calculations given in (2.100), (2.101), and 
(2.102). 

SOLUTION 

s _ / (+z|+x> (+z|-x) \ 1 / 1 1 \ 

V (—z|+x) ( z| x) / y/l V 1 -1/ 

Thus in the S x basis 


/ <+x|+z} (Tx | z) \ / <+z|/,|+z) (Tz| J z | z) \ 

V (—x|+z) ( x| z) / \ ( z|T z j-f-z) (—z|7 z |—z) / 

(+z|+x) (+z|-x) \ 

(-Z|+x) (~z|-x>J 

_J_/i l\H(l 0\_1_/I l\_fi/0 1\ 

_ Vl V 1 -17 2 U -l/v^Vl -1/ 2 v 1 0/ 

The state |+z) can be transformed into the S x basis by the matrix S 1 ', which 
in this case is equal to the matrix S: 

-i) (o) = 7! ( i) 

Thus the eigenvalue equation 7 z |+z) = (H/2)\+x) in the S x basis becomes 
h/o i\ j_ /i \ _ h_ j_ /1 

2 \ 1 Oj V2 Vl7 2^2 Vl 

As before, we see that the eigenvalue equation is satisfied with the same 
eigenvalue in either basis. 
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2.6 Expectation Values 


It is interesting to see how we can use matrix mechanics to calculate expectation 
values of observables like the z component of the angular momentum with which 
we have associated the operator J z . If a spin-| particle is in the state 


W = \+z){+z\f) + |-z>(-z|i/r> (2.103) 


then, as we saw in Section 1.4, the expectation value of S, is given by 

<^> = (0 |( + z|^)| 2 +(-0 |(-z|V /)| 2 (2.104) 

That is, the expectation value of S, is the sum of the results /?./2 and — h/2 of a 
measurement multiplied by the probability | (+z| t/r) | 2 and | (~z| xfr) | 2 , respectively, of 
obtaining each result. We can express this expectation value in matrix mechanics as 

h ( 1 0 \ /<+z|Vr>\ 

(S z ) = «f\+z), M-z)) - f , ) (2.105) 

2 VO -1 / V {-Z\f) ) 

as can be verified by explicitly carrying out the matrix multiplication. The right-hand 
side of (2.105) is the representation in the S z basis of (\j/\ J z \\jr). Thus, we can also 
express the expectation value in the form 

(S 2 ) = (f\J z \ir) (2.106) 


In the language of eigenstates and eigenvalues, the expectation value (2.104) is 
the sum of the eigenvalues with each weighted by the probability of obtaining that 
eigenvalue. The advantage of expressing the expectation value in the form (2.106) 
is that we needn’t evaluate it in a representation in which the basis states are the 
eigenstates of the operator in question. For example, we could evaluate (2.106) in 
the S x basis by inserting the identity operator (2.53) between the bra vector and the 
operator and between the operator and the ket vector. Then we have 


<Sz> = (WI+x>< W-x» 


(+x|7 2 |+x) (+x|J,|—x) 

(-x|/ z |+x) (-x|7 z |-x) 


<+X|^) \ 
(~x|V/> ) 


(2.107) 


You can verify that we can also go from (2.105) in the S z basis to (2.107) in the 
S x basis by inserting the identity operator §§ + before and after the 2x2 matrix 
in (2.105), provided we use the S-matrix (2.99) that transforms between these two 
basis sets. 

As an example, let’s return to (1.20), where we evaluated the expectation value 
of .S', for the state |+x). Substituting the column vector representation (2.6) for this 
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ket in the S, basis .into (2.105), we see that the expectation value may be written in 
matrix form as 


<s z > = 4=a> d- ( 1 °)4=( 1 > = ° 

V2 2 VO -\) yfl V 1 , 


(2.108) 


EXAMPLE 2.6 Use matrix mechanics to evaluate the expectation value 
(S z ) for the state |+x) in the S x basis states 

l+x) = ^ l+z) + Jf Hz) '~ x) = 7! l+z> " 2! Hz> 

SOLUTION In Example 2.5 we saw that in this basis 

h( 0 1 




1 0 


then for the state |+x) 


i 


: 0 


This result agrees of course with (2.108). In (2.108) the matrix form for the 
operator is especially straightforward, while here it is the representation for 
the state that is especially simple. 


EXAMPLE 2.7 Use matrix mechanics to determine (S z ) for the state 

1 i s/3 

\f) = -l+z) + — |-z) 

Compare your result with thatfef Example 1.2. 

SOLUTION 

1 r % ( 1 h 

W = _ 1 y2\iV3/ = -f 

in agreement with Example 1.2. 


2.7 Photon Polarization and the Spin of the Photon 


The previous discussion about representations of states and operators may seem 
somewhat mathematical in nature. The usefulness of this type of mathematics is just 
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,:x‘ 

-x 


Figure 2.10 Two sets of transmission axes of a polarizer that 
may be used to create polarization states of photons traveling 
in the z direction. 


a reflection of the fundamental underlying linear-vector-space structure of quantum 
mechanics. We conclude this chapter by looking at how we can apply this formalism 
to another physical two-state system, the polarization of the electromagnetic field. 
Many polarization effects can be described by classical physics, unlike the physics 
of spin-i particles, which is a purely quantum phenomenon. Nonetheless, analyzing 
polarization effects using quantum mechanics can help to illuminate the differences 
between classical and quantum physics and at the same time tell us something 
fundamental about the quantum nature of the electromagnetic field. 

Instead of a beam of spin-j atoms passing through a Stem-Gerlach device, 
we consider a beam of photons, traveling in the z direction, passing through a 
linear polarizer. Those photons that pass through a polarizer with its transmission 
axis horizontal, that is, along the x axis, are said to be in the state |x), and those 
photons that pass through a polarizer with its transmission axis vertical are said 
to be in the state |y). 12 These two polarization states form a basis and the basis 
states satisfy (x|y) = 0, since a beam of photons that passes through a polarizer 
whose transmission axis is vertical will be completely absorbed by a polarizer whose 
transmission axis is horizontal. Thus none of the photons will be found to be in the 
state \x) if they are put into the state |y) by virtue of having passed through the initial 
polarizer (assuming that our polarizers function with 100 percent efficiency). 

We can also create polarized photons by sending the beam through a polarizer 
whose transmission axis is aligned at some angle to our original x-y axes. If the 
transmission axis is along the x' axis or y' axis shown in Fig. 2.10, the corresponding 
polarization states may be written as a superposition of the |x) and |y) polarization 
states as 


|x') = |x)(x|x / ) + |y)(y|x') 

\y') = \x)(x\y l ) + \y){y\y l ) (2.109) 

What are the amplitudes such as (x|x'), the amplitude for a photon linearly 
polarized along the .r' axis to be found with its polarization along the x axis? 


12 These states are often referred to as |.t) and |y). A different typeface is used to help 
distinguish these polarization states from position states, which will be introduced in Chapter 6. 
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Figure 2.11 An x' polarizer followed by an 
x polarizer. 


A classical physicist asked to determine the intensity of light passing through a 
polarizer with its transmission axis along either the x or the y axis after it has passed 
through a polarizer with its transmission axis along xf, as pictured in Fig. 2.11, 
would calculate the component of the electric field along the x or the y axis and 
would square the amplitude of the field to determine the intensity passing through 
the second polarizer. If we denote the electric field after passage through the initial 
polarizer by then the components of the field along the x and y axes are given by 

E x = , ■ cos <p E v = E x > sin <j> 

Thus the intensity of the light after passing through the second polarizer with 
its transmission axis along the x or y axis is proportional to cos 2 cp or sin 2 cp, 
respectively. We can duplicate the classical results if we choose (x\x'} — cos cp and 
(y \x') = sin 4>. Similarly, if the first polarizer has its transmission axis along the 
y' axis and we denote the electric field after passage through this polarizer by Ey t 
then the components of the field along the ,r and v axes are given by 

E x = —£ v / sin cp E v = E y > cos cp 

Again, we can duplicate the classical results if we choose (x| y') = — sin cp and 
(y \y') = cos (p . Of course, the experiments outlined here alone do not give us any 
information about the phases of the amplitudes. However, since classical electromag¬ 
netic theory can account for interference phenomena such as the Young double-slit 
experiment, it is perhaps not too surprising that our conjectures about the amplitudes 
based on classical physics yield a valid quantum mechanical set, including phases: 

x') = cos cp\x) + singly} 

\y') = — sin <p\x) E cos <p\y) (2.110) 

Where do the quantum effects show up? Classical physics cannot account for 
the granular nature of the measurements, that a photomultiplier can detect photons 
coming in single lumps. Nor can it account for the inherently probabilistic nature of 
the measurements; we cannot do more than give a probability that a single photon 
in the state |x') will pass through a polarizer with its transmission axis along .r. For 
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example, if the angle (p = 60°, then a single photon after having passed through 
an x' polarizer has a probability of |{x|x'}| 2 = cos 2 60° = 0.25 of passing through a 
second x polarizer. Knowing the polarization state of the photon does not, in general, 
determine whether it will pass through a subsequent polarizer. All we can determine 
is the probability, much to the discomfiture of the classical physicist who would like 
to believe that such results should be completely determined if enough information 
is known about the state of the system. The classical and quantum predictions are, 
however, in complete accord when the intensity of the beams is high so that the 
number of photons is large. 

We can use (2.110) to calculate the matrix S f that transforms from the \x)-\y) 
basis to the \x')-\y') basis: 


/ (x'\x) (x'|y) \ _ / cos cp sin cp\ 
\ (y'\x) (y'|y) / \ —sin0 cos (p ) 


The matrix § that transforms from the |x')-|y') basis to the |x)-|y) basis is given by 

/<*'*') {x\y') \ / COS0 sin 0 \ 

V (y\x ! ) (y|y')/ Vsin^ cos <p ) 


You can check that these matrices satisfy S^S = I. All the elements of the matrix 
§ are real. In fact, it is an example of an orthogonal matrix familiar from classical 
physics for rotating a vector in the x-y plane counterclockwise about the 2 axis by 
an angle <p. We can express § in terms of the rotation operator R((pk) that rotates the 
ket vectors themselves in this direction (|x'> = R((pk)\x) and | y’) — R((pk)\y)): 

/ {x\R(4> k)|x) (x|^(^k)|y>\ /cos cp -sin <p\ 

S = I „ „ ) = ( (2.113) 

V{y|/?(0k)|x) {y\R{(pk.)\y) / \ sin (p cos <p ) 


There is another set of basis vectors that have a great deal of physical significance 
but cannot be obtained from the |x)-|y) basis by a simple rotation. We introduce 


I R) = 


1 

n 


(I*) + «|y» 


(2.114a) 


I L) = 


1 

7 ! 


(k> - i\y)) 


(2.114b) 


These states are referred to as right-circularly polarized and left-circularly polarized, 
respectively. 

First, let’s ask what the classical physicist would make of a right-circularly 
polarized electromagnetic plane wave of amplitude E 0 traveling in the z direction. 


E = E {) ie i<kz - 0Jt) + i E 0 (2.115a) 
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Of course, the classical physicist uses complex numbers only as a convenient way 
to express a wave. The physics is determined by the real part of (2.115a), or 

E = £ 0 i cos(Az — cot) — E 0 j sin(Az — cot) (2.115b) 

The “extra” factor of i in the y component of E in (2.115a) here means that the x 
and v components of the electric field are 90° out of phase, as (2.115b) shows. If 
we take z = 0 and examine the time dependence of the electromagnetic field, we see 
an E field that rotates in a circle as time progresses. If you curl your right hand in 
the direction of the changing E, your thumb points in the direction of propagation 
along the positive z axis. The E field of the left-circularly polarized electromagnetic 
plane wave rotates in the opposite direction and thus would require you to curl your 
left hand in the direction of changing E to have your thumb point in the direction of 
propagation. 

We can produce circularly polarized light by allowing linearly polarized light 
to fall on a birefringent crystal such as calcite that is cut so that the optic axis 
of the crystal lies in the x-y plane. Light polarized parallel to the optic axis in a 
birefringent crystal has a different index of refraction than does light perpendicular 
to the optic axis. We can orient our coordinate axes so that the optic axis is along 
x and the perpendicular axis is, of course, along y. Denoting the different indices 
of refraction by n x and n v , we see from (2.115a) that light polarized parallel to the 
x axis will pick up a phase ( n x co/c)z in traversing a distance z through the crystal. 
Similarly, light polarized parallel to the y axis will gain a phase (, n y a>/c)z . Thus 
a beam of linearly polarized light incident on such a crystal with its polarization 
axis inclined at 45° to the ,i axis will have equal magnitudes for the x and y 
components of the electric field, as indicated in Fig. 2.12, and there will be a phase 
difference \(n x — n v )co/c]z between these two components that grows as the light 
passes through a distance z in the crystal. The crystal can be cut to a particular 
thickness, called a quarter-wave plate, so that the phase difference is 90° when the 
light of a particular wavelength exit*? the crystal, thus producing circularly polarized 
light. 

What does the quantum physicist make of these circular polarization states 
(2.114)? Following the formalism of Section 2.2, it is instructive to ask how these 
states change under a rotation about the z axis. If we consider a right-circularly 



Figure 2.12 Plane-polarized light incident on a quarter-wave 
plate with its direction of polarization oriented at 45° to the 
optic axis will produce circularly polarized light. 
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polarized state that has been rotated by an angle <p counterclockwise about the z axis, 
we see that it can be expressed as 


\R') = -—(\x') + i\y')) 


= -n=[cos 4>\x) + sin (p\y) + /(— sin (j>\x) + cos 4>\y))\ 

v 2 


(cos (f> — i sin </>) 

—Ji 

e~ i4> \ R) 


(I*) +i\y)) 


(2.116) 


Thus this state picks up only an overall phase factor when the state is rotated about 
the z axis. Based on our experience with the behavior of spin-j states under rotations, 
(2.116) indicates that the state is one with definite angular momentum in the z 
direction. Since (2.32) shows that 


\#) = R(<pk)\R)^e~ iJ ^ h \R) (2.117) 


consistency with the preceding equation requires that 

J z \R) = h\R) (2.118) 

Similarly, if we rotate the left-circularly polarized state by angle 4> counterclockwise 
about the z axis, we obtain 


telling us that 13 


\L!) =e i<t> \L) 


( 2 . 119 ) 


J z \L) = -h\L) (2.120) 

Thus the right-circularly and left-circularly polarized states are eigenstates of J z , the 
operator that generates rotations about the z axis, but with eigenvalues ±ft, not the 
±H/2 characteristic of a spin-| particle. In Chapter 3 we will see that the eigenvalues 
ol J, tor a spin-1 particle are -j-fi, 0, and —h. Photons have intrinsic spin of 1 instead 
of j. The absence of the 0 eigenvalue for J z for a photon turns out to be a special 
characteristic of a massless particle, which moves at speed c. 


L A particle with a positive (negative) projection of the intrinsic angular momentum along the 
direction of motion is said to have positive (negative) helicity. Photons thus come in two types, 
with both positive and negative helicity, corresponding to right- and left-circularly polarized light, 
respectively. 
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EXAMPLE 2.8 Determine the matrix representation of the angular mo¬ 
mentum operator J z using both the circular polarization vectors | R) and \L) 
and the linear polarization vectors x) and |y) as a basis. 

SOLUTION Let’s shirt with the easy one first. Since the states \R) and | L) 
are eigenstates of J, with eigenvalues h and —h, respectively 




)/?)-|Z-> basis 


(R\J Z \R) 

(L\l\R) 


{R\J Z \L) 
( L\l\L) 


h o 

0 -h 


The matrix is diagonal in this basis with the eigenvalues of the basis states 
on the diagonal. Switching to the linear polarization states |x) and | y) : 


4 


M-ly> 


^ / (x\R) (x\L) \ 

^ V {y\R} { y\L) ) 




1 1 


(R\J Z \R) 
(L\J Z \R) 
h 0 \ 


(R\J Z \L) 

(L\l\L) 


1 


0 -h)y/2\\ 


— l 

i 


(R\x) (R\y) \ 

(L\x) (L\y) ) 

0 -i 

i 0 


h 


In this basis, the matrix has only off-diagonal elements. Since a Hermitian 
matrix is equal to its transpose, complex conjugate, both of these represen¬ 
tations for J z satisfy this condition, as they must. 


2.8 Summary 


In this chapter we have introduced operators in order to change a state into a different 
state. Since we are dealing here primarily with states of angular momentum, the 
natural operation is to rotate these states so that a state in which a component of the 
angular momentum has a definite value in a particular direction is rotated into a state 
in which the angular momentum h@s the same value in a different direction. 14 The 
operator that rotates states counterclockwise by angle <fi about the z axis is 

R(4>k) = e~ iJ ^ /n (2.121) 

where the operator J z is called the generator of rotations about the z axis. In general, 
for an arbitrary operator A, the bra corresponding to the ket 

A\f) = W) (2.122a) 

is 

(iA|A f =:(<p| (2.122b) 


14 This way of describing a rotation of an angular momentum state may seem somewhat 
awkward, but in Chapter 3 we will see why we cannot say that the angular momentum simply 
points in a particular direction. 
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where the dagger denotes the adjoint operator. Thus the rotated bra corresponding 
to the rotated ket 

R(<P k)\f) = e- iS ^ /h \f) (2.123a) 

is given by 

(x//\RH<P^) = (f\e iJ ^ /h (2.123b) 

In order for probability to be conserved under rotation. 

{if\R f (<pk)R(4>k)\^) = (f\e iP ^ /fi e- iS ^ lh \f) = (f\ t//) (2.124) 

which requires that the generators of rotation be Hermitian: 

J} = J Z (2.125) 

An operator like the rotation operator that satisfies R'R = 1 is called a unitary 
operator. 

Fora spin-4 particle, the spin-up-along-z state |+z) and spin-down-along-z state 
|-z) satisfy 

l\± z) = ±-|±z) (2.126) 

showing that when the generator of rotations about the z axis acts on these states, 
the result is just the state itself multiplied by the value of S, that these states are 
observed to have when a measurement of the intrinsic spin angular momentum in 
the z direction is carried out. Thus we can use a terminology in which we label the 
states |±z) by \S Z = ±.h/ 2}, that is, we label the states by their values of S : . Similarly, 
for example, 

j x \±x) = ± f j\±x) (2.127) 

where J x is the generator of rotations about the x axis. In Chapter 3 we will argue 
on more general grounds that we should identify the generator of rotations with the 
component of the angular momentum along the axis about which the rotation is 
taking place. In subsequent chapters we will see that the operator that generates 
displacements in space is the linear momentum operator and the operator that 
generates time translations (moves the state forward in time) is the energy operator. 
Thus we will see repeated a pattern in which a Hermitian operator A is associated 
with a physical observable and the result a n of a measurement for a particular state 
| a n ) satisfies 


A\a„) 


(2.128) 
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Note that for a Hermitian, or self-adjoint, operator (A = A"*’), the bra equation 
corresponding to (2.128) is 


(a n \A = (a n \a* (2.129) 

An equation in which an operator acting on a state yields a constant times the state 
is called an eigenvalue equation. In this case, the constant a n in (2. 128) is called the 
eigenvalue and the state \a n ) [or (a„ \ in (2.129)] is called the eigenstate. 

We will now show that the eigenvalues of a Hermitian operator are real. Taking 
the inner product of the eigenvalue equation (2.128) with the bra (a k \, we obtain 

(a k \A\a n ) = a n (a k \a n ) (2.130) 

Taking advantage of (2.129), this equation becomes 

a* k (a k \a n ) =a n {a k \a n ) (2.131a) 

or 

(ot-dnHOk |fl B >=0 (2.131b) 

Note that if we take k — n, we find 

(a* - a n ){a n \a n ) =0 (2.132) 

and therefore the eigenvalues of a Hermitian operator are real (a* = a n ), a necessary 
condition if these are to be the values that we obtain for a measurement. Moreover, 
(2.131b) shows that 


(a k \a n ) = 0 a k ^a n (2.133) 

as we argued in Chapter 1 must be tjrue based on the fact that (a k \a n ) is the amplitude 
to obtain a k for a particle in the state | a n ). This shows that the eigenstates of a 
Hermitian operator corresponding to distinct eigenvalues are orthogonal. Thus our 
association of Hermitian operators with observables such as angular momentum 
forms a nice, self-consistent physical picture. 

We also see that we can express the expectation value (A) of the observable A in 
terms of the operator A as 

(A) — (Tjf\A\ijf) (2.134) 

For simplicity, let’s consider the case where there are two eigenstates |a x ) and | a 2 ) 
with a t ^4 a 2 , as is the case for spin Since a general state can be written as 

\f) = c x \a x ) + c 2 \a 2 ) (2.135) 
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then 

W\Mf) = (c*{ail + C 2 (a 2 \)A(c l \a ] ) + c 2 \a 2 )) 

= (f*(ai| +4{a 2 |)(c 1 a 1 |fl|> + c 2 a 2 \a 2 )) 

— i c ll“ a l + \Cl\~ a 2 

= <A) (2.136) 

where the last step follows since the penultimate line of (2.136) is just the sum of the 
eigenvalues weighted by the probability of obtaining each of those values, which is 
just what we mean by the expectation value. 

Also note that, as in (1.40), (2.135) can be expressed in the form 

W) = \a l )(a l \i,) + \a 2 )(a 2 \f) (2.137) 

This suggests that we can write the identity operator in the form 

\ a i)( a \\ + \o-2)( a 2\ — 1 (2.138) 

which is also known as a completeness relation, because it is equivalent to saying 
that we can express an arbitrary state \ f) as a superposition of the states kq) and 
\a 2 ), as shown in (2.137). The identity operator can be decomposed into projection 
operators 

P, = |ai)(a,| and P 2 =\a 2 )(a 2 \ (2.139) 

that project out of the state | ifr) the component of the vector in the direction of the 
eigenvector. For example, 

P\W) = t«i}(aj^> (2.140) 

If we insert the identity operator (2.138) between the ket and the bra in the 
amplitude {(p\ijr), we obtain 

(< P\4 r ) = {ykri>(«i!^} + (<p\a 2 }(a 2 \ir) (2.141) 

Thus, it a particle is in the state |i \fr) and a measurement is carried out, the probability 
of finding the particle in the state | <p) can be written as 

\(<PW\ 2 = \{(p\ai){iti\i/r) + {(p\a 2 ) {a 2 \^r)\ 2 (2.142) 

Note that the amplitudes and {(p\a 2 ) (a 2 \\/s) can interfere with each other. 

Equation (2.142) presumes that no measurement of the observable A has actually 
taken place. If we were to actually insert a device that measured the observable A 
for the state \\[r), we would then find the probability to obtain the state \<p) given by 

Kf , l«i)| 2 |(a ] |i/ f >i 2 + \{<p\a 2 )\ 2 \(a 2 \^)\ 2 (2.143) 
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which is just the sunj of the probabilities of finding |i Jr) in the states |a ( ) and \a 2 ) times 
the probability that each of these states is found in the state \<p). Equations (2.142) and 
(2.143) illustrate one of the fundamental principles of quantum mechanics: When 
we do not make a measurement that permits us to distinguish the intermediate states 
|aj) and j a 2 ), we add the amplitudes and then square to get the probability, while if 
we do make a measurement that can distinguish which of the states |rq) and |a 2 ) the 
particle is in, we add the individual probabilities, not the amplitudes. For a specific 
example, see the discussion at the end of Section 2.3. 

A convenient shorthand notation is to use the eigenstates |«j) and \a 2 ) as a basis 
and represent a ket such as (2.135) by a column vector 


IV/)- 

faj)-ja2> basis 



(a ilVO \ 
(a% IV/) / 


(2.144) 


a bra by a row vector 


(V'l- 

\a\)-\ a 2) basis 

and an operator by a matrix 

B 


> = {Wfa), (t\a 2 )) (2.145) 


{ax\B\a x ) (u!lfi|a z )\ 
Q|)-jti2) basis V (a 2 \B\a\) (a 2 \B\a 2 ) J 

In this notation, an equation such as 


becomes 


B\i>) — W) 


(a]\B\ai) {ai\B^a 2 ) \ f (a\W) 
(a 2 \B\ai) (a 2 \B\a 2 ) ) V {a 2 \f) 


t {a x \ip) 
V {a 2 \<P) 


(2.146) 


(2.147) 


(2.148) 


Knowing the matrix elements {a { \B\a:) pennits us to evaluate the action of the 
operator B on any state \ xfr). As an example, we can use matrix mechanics to evaluate 
the expectation value of B in the state \ijr): 


{B) = W\W) = (W\a l ), ma 2 )) 


{ai\B\a { } {a x \B\a 2 ) 

{a 2 \B\a x ) {a 2 \B\a 2 ) 


(ail f) \ 
{aiW) / 
(2.149) 


where the last step follows from inserting the identity operator (2.138) between the 
bra (\jf\ and the operator B and between the operator B and the ket \\jr). Finally, note 
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that if basis states are the eigenstates of the operator, the matrix representation is 
diagonal with the eigenvalues forming the diagonal matrix elements: 13 

( a x 0 \ 

A -x (2.150) 

l«l>"l«2> basis V 0 «2 / 

All of the results (2.135) through (2.150) can be extended in a straightforward 
fashion to larger dimensional bases, as introduced in Section 1.6. For example, the 
identity operator is given by ]£„ \ a„)(a n \ in the more general case. 

Problems 


2.1. Show that 


lim 

fV-ACC 



— e 


X 


by comparing the Taylor series expansions for the two functions. 


2.2. Use Dirac notation (the properties of kets, bras, and inner products) directly 
without explicitly using matrix representations to establish that the projection oper¬ 
ator P + is Hermitian. Use the fact that P\ = P + to establish that the eigenvalues of 
the projection operator are 1 and 0. 

2.3. Determine the matrix representation of the rotation operator ft(0k) using the 
states |+z) and |— z) as a basis. Using your matrix representation, verify that the 
rotation operator is unitary, that is, it satisfies R^ ((pk) R((pk) = 1. 

2.4. Determine the column vectors representing the states |+x) and |—x) using the 
states |+y) and |— y) as a basis. 

2.5. What is the matrix representation of J z using the states |+y) and | —y) as a 
basis? Use this representation to evaluate the expectation value of S z for a collection 
of particles each in the state |— y). 

2.6. F.valuate /?(0j)|+z), where /?(0j) = e~ ,J y e ^ h is the operator that rotates kets 
counterclockwise by angle 9 about the y axis. Show that R (f j)|+z) = |+x). Sug- 


15 In general, there are an infinite number of sets of basis states that may be used to form 
representations in matrix mechanics. For example, in addition to the states |±z), the states |±x) 
can be used as a basis to represent states and operators for spin-| particles. However, since |±x) 
are not eigenstates of J,, the matrix representation of this operator using these states as a basis is 
not diagonal, as (2.100) shows. 
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gestion: Express the.ket |+z> as a superposition of the kets |+y) and |—y} and take 
advantage of the fact that J |±y) = (±fi/2)|±y); then switch back to the |+z)-| —z) 
basis. 

2.7. Work out the matrix representations of the projection operators P, = |+z)(+z| 
and = j— z>{—z| using the states |+y) and |—y) of a spin-^ particle as a basis. 
Check that the results (2.51) and (2.52) are satisfied using these matrix representa¬ 
tions. 


2.8. The column vector representing the state \\jr) is given by 

\f) —> 4 = ( ' ) 

S z basis v /5 \ 2 / 

Using matrix mechanics, show that |i Jr) is properly normalized and calculate the 
probability that a measurement of S x yields h/2. Also determine the probability that 
a measurement of S Y yields h/2. 


2.9. Suppose in a two-dimensional basis that the operators A and B are represented 
by the 2 x 2 matrices 



Show that (AS) f = £ t A t . 



6 

8 


2.10. Determine the matrix representation of J x in the S, basis. Suggestion: Start 
with the matrix representation of the operator S x using the states 


I+X) = -4l+z) + ~ |-z) 

x/2 y/2 


7t +1) - 


as a basis and then transform to the £ z basis. 


2.11. The column vector representing the state \\fr) is given by 

7^(72) 

Use matrix mechanics and the result of Problem 2.10 to determine (S x ) for this state. 


2.12. A photon polarization state for a photon propagating in the z direction is 
given by 


IVO = 



V3 


|y> 
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(a) What is the probability that a photon in this state will pass through an ideal 
polarizer with its transmission axis oriented in the y direction? 

(b) What is the probability that a photon in this state will pass through an ideal 
polarizer with its transmission axis y' making an angle <j> with the y axis? 

(c) A beam carrying N photons per second, each in the state \tjr}, is totally 
absorbed by a black disk with its normal to the surface in the z direction. 
How large is the torque exerted on the disk? In which direction does the 
disk rotate? Reminder: The photon states | R) and | L) each carry a unit H of 
angular momentum parallel and antiparallel, respectively, to the direction of 
propagation of the photons. 

(d) How would the result for each of these questions differ if the polarization state 
were 

1V,,) = vf x) + 7i ly) 

that is, the “i” in the state \tfr) is absent? 

2.13. A system of N ideal linear polarizers is arranged in sequence, as shown in 
Fig. 2.13. The transmission axis of the first polarizer makes an angle of <j)/N with 
the y axis. The transmission axis of every other polarizer makes an angle of (p/N 
with respect to the axis of the preceding one. Thus, the transmission axis of the final 
polarizer makes an angle cf> with the y axis. A beam of y-polarized photons is incident 
on the first polarizer. 

(a) What is the probability that an incident photon is transmitted by the array? 

(b) Evaluate the probability of transmission in the limit of large N. 

(c) Consider the special case with the angle <p — 90°. Explain why your result is 
not in conflict with the fact that { x\y) = 0. 16 

2.14. 

(a) Determine a 2 x 2 matrix § that can be used to transform a column vector 
representing a photon polarization state using the linear polarization vectors 
|x) and \y) as a basis to one using the circular polarization vectors \R) and 
|E) as a basis. 

(b) Using matrix multiplication, verify explicitly that the matrix § that you found 
in (a) is unitary. 


16 A nice discussion of the quantum state using photon polarization states as a basis is given 
by A. P French and E. F. Taylor. An Introduction to Quantum Physics, Norton. New York. 1978, 
Chapters 6 and 7. Problem 2.9 is adapted from this source. 
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Figure 2.13 An array of /V linear polarizers. 

2.15. Evaluate the matrix elements 

/ <*|i z |x> <x|J 2 |y> \ 

V(y|i z |x> (y|i z |y) / 

by expressing the linear polarization states |x) and |y) in terms of the circular 
polarization states \R) and \L). Compare your result with that given in Example 2.8. 

2.16. Use both the matrix representations of the angular momentum operator J z from 
Example 2.8 to determine the expectation value of the angular momentum for the 
photon state a\R ) + b\L). 

2.17. Use the matrix representation of the rotation operator R(<pk) in the |x)-|y) 
basis as given in (2.113) to establish that the photon circular polarization states 
(2.114), expressed as column vectors in the |x)-|y) basis, are eigenstates of the 
rotation operator with the eigenvalues that appear in (2.116) and (2.119). 

2.18. Construct projection operators out of bras and kets for .v-polarized and y- 
polarized photons. Give physical examples of devices that can serve as these pro¬ 
jection operators. Use (a) the properties of bras and kets and (b) the properties of 
the physical devices to show that the projection operators satisfy C 2 = P x , P v 2 = P y , 
and P x P y - P y P x = 0. 

2.19. Show' that J z = h\R){R\ - ft\L)(L\ for photons. 

2.20. What is the probability that a right-circularly polarized photon will pass 
through a linear polarizer with its transmission axis along the x' axis, which makes 
an angle <p with the x axis? 
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2.21. Linearly polarized light of wavelength 5890 A is incident normally on a 
birefringent crystal that has its optic axis parallel to the face of the crystal, along 
the x axis. If the incident light is polarized at an angle of 45° to the x and y axes, 
what is the probability that the photons exiting a crystal of thickness 100.0 microns 
will be right-circularly polarized? The index of refraction for light of this wavelength 
polarized along y (perpendicular to the optic axis) is 1.66 and the index of refraction 
for light polarized along x (parallel to the optic axis) is 1.49. 

2.22. A beam of linearly polarized light is incident on a quarter-wave plate with its 
direction of polarization oriented at 30° to the optic axis. Subsequently, the beam 
is absorbed by a black disk. Determine the rate at which angular momentum is 
transferred to the disk, assuming the beam carries N photons per second. 


2.23. 

(a) Show that if the states \a n ) form an orthonormal basis, so do the states U\a n ), 
provided U is unitary. 

(b) Show that the eigenvalues of a unitary operator can be written as e' 0 . 

2.24. The Hermitian operator A corresponding to the observable A has two eigen¬ 
states |uj) and \a 2 ) with eigenvalues a { and a 2 , respectively. Assume ^ a 2 . Show 
that A can be written in the form 

A -a,|o 1 )(a 1 | +a 2 \a 2 )(a 2 \ 


and that 


W\A\f)**iA) 
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CHAPTER 3 

Angular Momentum 


In this chapter we will see that the order in which we cany out rotations about differ¬ 
ent axes matters. Therefore, the operators that generate rotations about these different 
axes do not commute, leading to commutation relations that may be viewed as the 
defining relations for the angular momentum operators. We will use these commuta¬ 
tion relations to determine the angular momentum eigenstates and eigenvalues. We 
will also see that the spin-^ states that have occupied much of our attention so far 
appear as a particular case of this general analysis of angular momentum in quantum 
mechanics. 


3.1 Rotations Do Not Commute and Neither Do the Generators 


Take your textbook and set up a convenient coordinate system centered on the book, 
as shown in Fig. 3.1. Rotate your text by 90° about the x axis and then rotate it by 90° 
about the y axis. Either note carefully the orientation of the text or, better still, borrow 
a copy of the text from a friend and perform the two rotations again, but this time 
first rotate about the y axis by 90° and then about the x axis by 90°. The orientations 
of the two texts are different. Clearly, the order in which you carry out the rotations 
matters. We say that finite rotations about different axes do not commute. 

In Section 2.7 we determined the matrix S that transforms a basis set of polar¬ 
ization states to another set that are related to the initial set by a rotation by angle <p 
counterclockwise about the z axis. The matrix (2.112) is also the matrix that is used 
to rotate the components of an ordinary vector in the x-y plane. Our familiarity with 
this example makes it a good one to use to analyze in more detail what happens when 
we make rotations about different axes. Rather than working directly with the actual 
operators that perform these rotations in our quantum mechanical vector space, we 
will initially work in a specific representation and infer from the behavior that we 
see some fundamental properties about the operators themselves. The results we are 

75 
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Figure 3.1 Noncornmutativity of rotations. A book, shown 
in (a), is rotated in (b) by 90° around the .r axis, then 90° about 
the v axis; in (c) the order of the rotations is reversed. 



Figure 3.2 Rotating vector A into vector A' by angle <p counterclockwise 
about (a) the z axis, (b) the x axis, and (c) the y axis. For simplicity, only 
the components of the vector in the plane perpendicular to the axis of 
rotation are shown. 


interested in depend on the three-dimensional structure of space and are properties 
that manifest themselves in all nontrivial representations. 

Let’s consider an ordinary three-dimensional vector A and a vector A' that is 
obtained by rotating A counterclockwise by an angle <p about the z axis. How are 
the components of A and A' related to each other? Denoting by 9 the angle between 
the projection of A in the x-y plane and the x axis, as in Fig. 3.2a, we have 


A' x = y'/L + A^ cos(<^> + 9) = y A 2 + A* (cos <p cos 9 — sin (j> sin 9) 

= A x cos (p — A v sin cp (3.1a) 


A' y = yA2 + A“ sin ((p + 9) — JA 2 x + A 2 v (sin <p cos 9 + sin 9 cos <p ) 

= A x sin <p + A y cos (p (3.1 b) 

A'=A X (3.1c) 
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or, in matrix form. 


(K) 


/ COS 0 

— sin 0 

°\ 

(A x 

K 

= 

sin 0 

COS 0 

0 

Ay 

\A'J 


\ o 

0 

1 

Uz 


(3.2) 


Thus the matrix that rotates the vector by angle <p counterclockwise about the z axis 
is given by 


S(0k) = 


/ COS 0 
sin 0 


V 0 


— sin 0 0 \ 

cos 0 0 

0 1 / 


(3.3) 


The 2x2 matrix in the upper left-hand corner is just the matrix (2.112). Because 
we are dealing here with a vector that has three components, the rotation matrix is 
a 3 x 3 matrix instead of the 2 x 2 matrix that we found for rotating polarization 
states. The additional elements in this matrix (3.3) simply show that the component 
of the vector in the z direction is unaffected by a rotation about the z axis. 

We consider the special case where the angle is a small angle A0 and retain terms 
in the Taylor series expansions for sin A0 and cos A0 through second order. It is 
necessary to work to at least this order to see the noncommutativity of the rotations. 
Thus 


S(A0k) = 


/ 1 - A0 2 /2 
A 0 

V 0 


-A0 0\ 

1 - A0 2 /2 0 

0 1 ) 


(3.4) 


From Fig. 3.2b we see that for a rotation about the x axis by angle 0, the matrix 
for the rotation can be obtained from the matrix (3.3) by letting x -> y, y —> z, and 
z —> x, that is, by a cyclic substitution. Therefore, the rotation matrix is 


S(0i) 


/ 1 0 0 \ 

'"0 cos0 — sin0 
\o sin 0 cos 0 / 


and consequently 


S( A0i) = 


/ 1 

0 

\0 


0 0 \ 

1 — A0 2 /2 —A 0 

A0 1 - A0 2 /2 


(3.5) 


(3.6) 


Finally, we can obtain the matrix for a rotation about the y axis from the matrix for 
a rotation about the x axis by another cyclic substitution (see Fig. 3.2c). Thus 


r 

S(A0j) = 


Aip 2 /2 

0 


\ -A0 


0 

1 

0 


A 0 \ 

0 

1 - A0 2 /2 ) 


(3.7) 
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We now consider a rotation by A<p about the y axis followed by a rotation by the 
same angle about the x axis. We subtract from it a rotation about the x axis followed 
by a rotation about the y axis. Multiplying the matrices (3.6) and (3.7), we obtain 



/ o 

— A <p 2 

0\ 

S(A$i)S(A0j) - S(A<,6j)S(A0i) = 

A (p 2 

0 

0 


V 0 

0 

Oy 


= S(A(p\) — 1 (3.8) 

where in the last step we have taken advantage of the explicit form of the matrix 
(3.4) when the rotation angle is A/p 2 and terms through order A <p 2 are retained. 

From Section 2.5 we know that these S-matrices are the matrix representations 
of the rotation operators. For example, the matrix (3.3) is the representation of the 
rotation operator R(<pk) in a particular basis. 1 Equation (3.8) shows that when we 
retain terms through second order in A<f>, the operators themselves do not commute. 
Recall from (2.32) that the operator that rotates states by angle <p about the z axis is 

R(4> k)=<r*^ /a (3.9) 

where J z is the generator of rotations. We can think of this as a special case of the 
more general rotation operator 


R((pn) = e~ i3 ' n(t,ffi (3.10) 

that rotates states by angle (p about the axis defined by the unit vector n. Thus the 
operators that rotate states by angle (p about the x axis and the y axis are given by 

R{(f>[)=e~ iiM% and R{<p j) = «r'V /B (3.11) 

with generators J x and J respectively. Thus, if we take the angle of rotation to be 
the small angle A (p and expand the rotation operators through second order in A <p, 
(3.8) tells us that 


1 Although we have phrased our discussion so far in terms of how ordinary vectors change 
under rotations, we are effectively using spin-1 states like the ones we saw in Section 2.7 as a 
basis, but with three states instead of just the two states that are necessary to describe photon 
polarization. We argued in that section that the way the photon polarization states changed under 
rotation told us that photons are spin-1 particles. If photons traveling in the z direction were to 
have a \z) polarization state as well as |x) and |y), this | z) polarization state would not be changed 
by performing a rotation about the z axis, and the matrix representation of the rotation operator 
R(<pk) using the \x), \y), and jz) states as a basis would look like (3.3) instead of (2.113). Later 
in this chapter we will see how spin-1 states do form a three-dimensional basis. Again, particles 
like photons that move at c require special treatment. 
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-'fP-i } 

i 

<1 

<< 

t 

( 

h 2 1 

{ h ) 


ijyAcfi 1 / J y A</> \ 

~ 2 \ h ) 


iJ y A(p l / J y A(p\ 2 

~h. 2 \ H J 


iJ x A<j> 

h 


J x A(p 


2 



(3.12) 


The lowest order nonvanishing terms involve A</> 2 . Equating these terms, we obtain 


J x Jy - JyJ x = ihJ, (3.13) 

or 

[j x ,J y \ = ihJ : (3.14a) 


where the left-hand side of the equation is called the commutator of the two 
operators J x and The commutator of two operators is just the product of the 
two operators subtracted from the product of the two operators with the order of 
the operators reversed. Notice how Planck's constant enters on the right-hand side 
of (3.14a). 

If we were to repeat this whole procedure for rotations about the v and z axes 
and for rotations about the z and x axes, we would obtain two other commutation 


relations related to (3.14a) by the cyclic permutation x —+■ y, y - 

> z, and z —x x : 

1 -1 y- 'l - 1 — ihJ x 

(3.14b) 

and 


./ A | = ifljy 

(3.14c) 


It would be difficult to overemphasize the importance of these commutation 
relations. In Section 3.3 we will see that they alone are sufficient to determine the 
eigenstates and the eigenvalues of the angular momentum operators. So far, our 
arguments to establish that these generators of rotations should be identified with 
the angular momentum operators are probably at best suggestive. The proof is in the 
results and the comparison with experiment. 

Later we will see that the orbital angular momentum operators 

L = rxp (3.15) 


also obey these same commutation relations, that is, for example, 

[L x ,L y \ = ihL z (3.16) 
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However, we have not introduced angular momentum operators through (3.15), but 
rather simply as the generators of rotations. Although this approach may seem more 
abstract and initially less physical, it is also more general and, in fact, essential. In 
Chapter 9 we will see that the eigenvalues of orbital angular momentum, as defined 
by (3.15), do not include the half-integral values that characterize spin-4 particles 
such as electrons, protons, neutrons, and neutrinos. 

3.2 Commuting Operators 


The commutation relations of the generators of rotations show that the generators 
of rotations about different axes do not commute with each other. As we saw in 
Chapter 2, these generators are Hermitian operators. Before turning our attention 
toward solving the angular momentum eigenvalue problem, we need to ask what 
happens when two operators do commute. Consider two such linear Hermitian 
operators A and B that satisfy 

[A, B] = AB - BA = 0 (3.17) 

Suppose there exists only a single state | a) that is an eigenstate of A with eigen¬ 
value a: 


A\a)=a\a) (3.18) 

If we apply the operator B to (3.18), we obtain 

BA\a) = Ba\a) (3.19) 

On the left-hand side we take advantage of (3.17) and on the right-hand side we take 
advantage of the fact that B is a linear operator to write 

AB\a)=aB\a) (3.20a) 

or 

A(B\a))=a(B\a)) (3.20b) 

where we have inserted the parentheses to isolate the state B\a) on both sides. 

Equation (3.20) says that the state B\a) is an eigenstate of the operator A with 
eigenvalue a. Since we have presumed there is only one such state, we conclude 
that 


B\a) = b\a) (3.21) 

where b is a constant, since if \a) satisfies (3.18), so does b\a) for any constant b. 
But (3.21) says that |a) is an eigenstate of B as well with eigenvalue b. Therefore, 
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E = pV2m E = p 2 /2m Figure 3.3 A free particle with momentum p has the 

* p " -p * same energy as one with momentum — p. 

we can relabel the state | a) as \a, b) to show both of the eigenvalues and say that 
A and B have the eigenstate \a, b) in common. An example of a state that can be 
labeled by two eigenvalues is the state | E, p) of a free particle in one dimension, 
where E is the energy and p is the momentum of the particle. 

If there is more than one eigenstate of the operator A with eigenvalue a , we say 
that there is degeneracy. Our proof has established that each eigenstate of A is also 
an eigenstate of B for those states that are not degenerate. If there is degeneracy, 
one can always find linear combinations of the degenerate eigenstates of A that 
are eigenstates of the Hermitian operator B. Thus two Hermitian operators that 
commute have a complete set of eigenstates in common. This result follows from the 
fundamental spectral theorem of linear algebra. We will not prove it here, but we will 
have a number of opportunities in later chapters to verify that it holds in special cases. 
In fact, the example of the one-dimensional free particle can serve as an illustration, 
since for a particular energy E = p 2 /2m there is two-fold degeneracy: the states 
\E, p) and |£, —p) have the same energy but momenta p and -p. respectively, 
corresponding to a particle moving to the right or the left (see Fig. 3.3). Note that 
you can certainly form states that are superpositions of the states | E,p) and | E,—p) 
(such as standing waves), so states with a definite energy need not have a definite 
momentum. 


EXAMPLE 3.1 Equation (2.113) gives the matrix representation 


/ (x|/?(0k)|x) (x|/?(0k)|y) \ _ / cos 0 -sin0 \ 

V (y|£(0k)|x) (y\R(<pk)\y) ) \sin0 cos0 / 

■ t- 

of the rotation operator R(<p k) using the linear polarization vectors |x) and 
|y) for photons as a basis. Example 2.8 shows that 



in the same basis. Show that these operators commute and therefore have 
eigenstates in common. What are these eigenstates and what are the matrix 
representations for R(<pk) and 7, using these eigenstates as a basis? 

SOLUTION It is straightforward to verify that these operators commute: 


-”"*) n(° 3 -fi/° 

V sin 0 cos 0 / v i Of \i 0 


( cos 0 — sin 0 

sin 0 cos 0 


= 0 
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We know from Section 2.7 that the eigenstates of J 7 are the circular polar¬ 
ization states \R) and | L) with eigenvalues h and —ft,, respectively. Conse¬ 
quently, as given in Example 2.8, 

/(R\J Z \R) (R\J z \L)\ = /h 0 \ 

\{L\J Z \R) (L\J Z \L) ) VO -h) 

Since R(tp k) = we a i so see that 

/ (R\R(cpk)\R) (R\R(<f,k)\L)\_Ce^ 0\ 

V {L\R(<f>k)\R) (L\R(<PV\L) )~ V 0 e 1 *) 

consistent with the fact that these two operators have the eigenstates \R) and 
\L) in common. Using these eigenstates as a basis, the matrix representations 
of both operators are diagonal with the corresponding eigenvalues as the 
diagonal matrix elements. 


3.3 The Eigenvalues and Eigenstates of Angular Momentum 


Although the commutation relations (3.14) show us that the generators of rotations 
about different axes do not commute with each other, the operator 

J 2 = j. j = / 2 + ./ 2 +/ 2 (3.22) 

does commute with each of the generators. 2 In order to verify this, we choose 
the generator of rotations about the z axis, and use the identity (see Problem 3.1) 

[A, BC] = B[A, C] + | A, B]C (3.23) 


to obtain 3 


\LP +i 2 +/?|: 


\j z , ,/ 2 | + \j z , P\ + [7 Z , y 2 j 


— J X [J Z , 3 X \ + [J z , J X ]J X + JylL, Jy ] A [j_, Jy\Jy 

— ih(J x Jy + JyJ X — Jy J X — J X Jy) = 0 (3.24) 


2 The operator .1 = ./ v i f ./. j + 7.k is a vector operator. For vector operators such as j we 
use the notation j 2 = (J x i + ,/ v j + j z k) ■ ( J x i + i v j + J z k) = J 2 + JJ + J;. 

3 We will use commutator identity (3.23) as well as its analogue |A/?, C] = A\B, C]+ 
[4, C\B often when evaluating a commutator that involves a product of operators. In general, 
this is much easier than starting by expanding the commutator using the defining relationship 
14, BC) — ABC — BCA. You are encouraged to work out Problem 3.1 so you feel comfortable 
with these commutator identities. 
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Because the operator J 2 commutes with J z , these operators have simultaneous 
eigenstates in common. We label the kets |X, m), where 

J 2 \k, m) = kh 2 \k, m) (3.25a) 

J z \k, m) = mh\k, m) (3.25b) 

We have explicitly included the dimensions of the operators in the factors of ti so 
that /. and m are dimensionless. Thus a. m) is a state for which a measurement of 
the z component of the angular momentum yields the value mil and the magnitude 
squared of the angular momentum is kh 2 . 

We can see that k > 0, as we would expect physically since k specifies the 
magnitude squared of the angular momentum in the state |k, m). Consider 

{k, m|J 2 |A, m) = kfi 2 {k, m\k, m) (3.26) 

Like all physical states, the eigenstates satisfy ( k , m\k, m) = 1. A typical term in 
the left-hand side of (3.26) is of the form 

(k, m\J 2 \k, m) = Or\iff) (3.27) 

where we have defined |A. m) = |i (r), and (\ff \ = {k, m\J x since J x is Hermitian. 
Although the ket |t/r) is not normalized, we can always write it as | ijr} = c\<p), where 
c is a complex constant (that must have the dimensions of h) and | cp) is a physical 
state satisfying (<p\<p) = 1. In other words, the action of the operator J x on a ket vector 
must yield another ket vector that belongs to the vector space. 4 Since I — c*{(p\, 
we see that (i/rl^) = c*c{(p\<p) > 0, where the equality would hold if c = 0. Our 
argument that (3.27) is positive semidefinite holds for each of the three pieces [see 
the form (3.22) of j 2 | on the left-hand side of (3.26), and therefore k > 0. 


AN EXAMPLE: SPIN 1 t 

'•.-Sss* • 

To illustrate what we have discovered so far and suggest the next step, let’s take the 
specific example involving the following three 3x3 matrices: 


/ 0 1 0 



A 

A 


/0 -i 

° 1 




0 

0 1 

i 0 

—i 

l- 

> h 

0 

0 

0 

\0 i 

0 ) 



U 

0 -\) 


(3.28) 


4 Because J x is the generator of rotations about the x axis, the ket (1 — i J x d<p/fi) |A, m) is just 
the ket that is produced by rotating the ket |A, m) by angle d<j> about the x axis. Thus the ket |i (r) 
can be viewed as a linear combination of the rotated ket and the ket | X,m), that is, a superposition 
of two physical states. 
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For now, don’t worry about how we have obtained these matrices. Later in this 
chapter we will see how we can deduce the form of these matrices (see Example 3.3 
and Problem 3.14). In the meantime, let’s see what we can learn from the matrices 
themselves. 

To begin, how can we be sure that these three matrices really represent angular 
momentum operators? Following our earlier discussion, it is sufficient to check (see 
Problem 3.13) that these matrices do indeed satisfy the commutation relations (3.14). 
We next calculate 


J 2 = j. j = j* + y f + jt -> 2 Tr 



(3.29) 


We see explicitly that J 2 is just a constant times the identity matrix and thus com¬ 
mutes with each of the components of j. The operator J z is diagonal as well, sug¬ 
gesting that the matrix representations (3.28) are formed using the eigenstates of 
J z as well as J 2 as a basis. The column vectors representing these eigenstates are 
given by 5 


/1\ 

(°\ 

f°\ 

0 

1 I and 

0 

W 

w 

w 


(3.30) 


which have eigenvalues h, 0, and - fi, respectively, as can be verified by operating 
on them with the matrix representing J„. For example. 



/1 

O 

O 



( l \ 

h 

0 

0 0 

0 

= h 

° 


U 

0 -1/ 

W 


W 


(3.31) 


Similarly, we see that each of these states is an eigenstate of J 2 with eigenvalue 2 h 2 . 

Since the matrix representations of J x and J y are not diagonal, the states (3.30) 
are not eigenstates of these operators. It is straightforward to evaluate the action of 
the operators J x and J y on the basis states. There is, however, a linear combination 
of these two operators, namely. 



(° 

1 

°\ 

J x + i Jy —> V2h 

0 

0 

1 


u 

0 

0/ 


(3.32) 


whose action on the basis states exhibits an interesting pattern. Applying this oper¬ 
ator to the basis states (3.30), we obtain 


5 Compare these results with (2.70). (2.71), and (2.72) for a spin-? particle. 
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m 

(0 

1 

°\ 

(°\ 

sfin 

0 

0 

1 

0 


u 

0 

0) 

w 


(° 

1 

°\ 

(0\ 

V2ft 

0 

0 

1 

' 


U 

0 

0 ) 

VO/ 


/ 0 \ 


n 


h 


s/ih 


(0 

1 



/ 

0 

0 

A 

0 

= 

V0 

0 

0/ 

UJ 

V 


(3.33) 


(3.34) 


(3.35) 


Thus, according to (3.33), the operator J x + iJ acting on the state with eigenvalue 
—ft. for J z turns it into a state with eigenvalue 0, multiplied by v2ft. Similarly, as 
(3.34) shows, when the operator acts on the state with eigenvalue 0 for J z , it turns it 
into a state with eigenvalue ft, multiplied by -Jlfi. This raising action terminates 
when the operator J x + i J y acts on the state with eigenvalue ft, the maximum 
eigenvalue for J z . See (3.35), ft can be similarly verified that the operator 



s/2h 


/0 0 0 \ 
1 0 0 
Vo 1 0/ 


(3.36) 


has a lowering action when it acts on the states with eigenvalues ft and 0, turning 
them into states with eigenvalues 0 and —ft, respectively. In this case, the lowering 
action terminates when the operator (3.36) acts on the state with eigenvalue -ft, the 
lowest eigenvalue for J z . 


RAISING AND LOWERING OPERATORS 

Let’s return to our general analysis pf angular momentum. The example suggests 
that it is convenient to introduce the two operators 

J ± ~J x ±iJ y (3.37) 

in the general case. Notice that these are not Hermitian operators since 

yt = yf + (_,•)yf = J x - ij y = L (3.38) 

The utility of these operators derives from their commutation relations with J z : 

[J z , y ± ] = [J z , J x ± i J y \ = ihJ y ± i{-ihJ x ) = ±h.J± (3.39) 

To see the effect of J + on the eigenstates, we evaluate J z J + \k,m). We can use the 
commutation relation (3.39) to invert the order of the operators so that J z can act 
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directly on its eigenstate |A., m). However, since the commutator of ,/ z and ,/ + is not 
zero but rather is proportional to the operator J + itself, we pick up an additional 
contribution: 

J Z J + 1A, m) = (J + J z + HJ + ) |A, m) 

= ( J + mh + hJ + )\X, m) 

= (:m + l)fi/ + |A, m) (3.40a) 

Inserting some parentheses to help guide the eye: 

J Z (J+ |A, m)) = (m + \)h{J + \k, m)) (3.40b) 

we see that J + 1A, m) is an eigenstate of with eigenvalue ( m + 1 )h. Hence J + is 
referred to as a raising operator. The action of J + on the state |A, m) is to produce 
a new state with eigenvalue (m + 1 )h. 

Also 


J Z J_ |A, m) — (J_J Z — m) 

= ( J_mh — HJ_) |A, m) 

= (m — m) (3.41a) 

Again, inserting some parentheses, 

J Z (J- 1A, m )) = (m — m )) (3.41b) 

showing that J_\k, m) is an eigenstate of J z with eigenvalue (m — 1 )h\ hence ./_ 
is a lowering operator. Notice that since J + and J_ commute with J 2 , the states 
J ± |A, m) are still eigenstates of the operator J 2 with eigenvalue kft 2 : 

j 2 (y ± |A, m)) = / ± j 2 |A, m) = kh 2 (J ± \k, m >) (3.42) 


THE EIGENVALUE SPECTRUM 

We now have enough information to determine the eigenvalues k and m, because 
there are bounds on how far we can raise or lower m. Physically (see Fig. 3.4), we 
expect that the square of the projection of the angular momentum on any axis should 
not exceed the magnitude of J 2 and hence 

m 2 < k (3.43) 


Formally, since 


(k,m\(J 2 + J 2 )\k,m) >0 


(3.44) 
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z 



Figure 3.4 The projection of the angular momentum on the 
axis never exceeds the magnitude of the angular momentum. 
Caution: This is a classical picture; the angular momentum 
cannot point in any definite direction. 


we have 


(X, m|(j 2 — J 2 )\X, m) = (X - m 2 )h 2 {X , m\X, m) > 0 (3.45) 


establishing (3.43). 

Let’s call the maximum m value j. Then we must have 


4 |A.,7>=0 


(3.46) 


since otherwise J + would create a state |A, j + 1), violating our assumption that j is 
the maximum eigenvalue for J z . 6 Lfsing 


J- J+ = (J x ~ i Jy)(J x + 1 A) 

= 4 + Jy + ill , Jy] 

= J 2 - J 2 - hJ z 

we see that 

Jj+ \X, j) = (J 2 - J 2 - hJ z )\X, j) 

= kb- J 2 - j)& 2 1*» j) =° 

or a = j (j + 1). 

Similarly, if we call the minimum m value j', then 


(3.47) 


(3.48) 


i_|A, /) = o 


(3.49) 


and we find that 

j+L\x, f) = a 2 - J 2 + nj z )\xj') 

= (X - f 2 + j')H 2 |X,/) = 0 (3.50) 


6 Equation (3.35) demonstrates how this works for the special case of spin 1. 
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7 

7-1 

7-2 


- --j + 2 

j + 1 Figure 3.5 The possible m values for a fixed magnitude 
- -j Jj (j + l)fi of the angular momentum. 

In deriving this result, we have used 

J | j„ = ( j X + i jy)(J x — 1 Jy ) 

= ■/,' + Jy _ Jy] 

= J 2 -jJ + hJ z (3.51) 

Thus k = j' 2 — j'. The solutions to the equation j 2 + j — j' 2 — j', which results 
from setting these two values of k equal to each other, are j' = — j and j' = j + 1. 
The second solution violates our assumption that the maximum m value is j. Thus 
we find the minimum m value is — j. 

If we start at the m = j state, the state with the maximum m, and apply the 
lowering operator a sufficient number of times, we must reach the state with m = — j, 
the state with the minimum m. If this were not the case, we would either reach a state 
with an m value not equal to —j for which (3.49) is satisfied or we would violate 
the bound on the m values. But (3.49) determines uniquely the value of j' to be — j. 
Since we lower an integral number of times, j — j' = j — (—j ) = 2j = an integer, 
and we deduce that the allowed values of j are given by 

j = 0, —, 1, —, 2, ... (3.52) 

2 2 

As indicated in Fig. 3.5, the m values for each j run from j to — j in integral steps: 

m ~ h ' ~ ^ J ~ 2 ’ ; • •» ~J + ~J , ^ 3 - 53 ) 

2j +1 states 

Given these results, we now change our notation slightly. It is conventional to 
denote a simultaneous eigenstate of the operators J 2 and J z by | j, in) instead of 
|A., m) = | j(j + 1), m). It is important to remember in this shorthand notation that 

J 2 | j, m) = j(j + l)h 2 \j, m) (3.54a) 

as well as 

•417. m) = mh\j , m) (3.54b) 
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-■ 


m- 0 


m = 1/2 
m = - 1/2 


m = 1 
m = 0 
m = - 1 


m = 3/2 
m = 1/2 
m = -1/2 


m = -3/2 


(a) (b) (c) (d) 

Figure 3.6 The m values for (a) spin 0, (b) spin ?, (c) spin 1, and (d) spin 


Let’s examine a few of these states, for which the m values are shown in Fig. 3.6. 

1. The 7 = 0 state is denoted by |0, 0). Since the magnitude of the angular 
momentum is zero for this state, it is not surprising that the projection of the 
angular momentum on the z axis vanishes as well. 

2. The 7 = 2 states are given by ||, |> and ||, — |). Note that the eigenvalues 
of J z for these states are ft/2 and —ft/2, respectively. These states are just the 
states |+z) and |—z) that have concerned us for much of Chapters 1 and 2. 
We now see the rationale for calling these states spin-| states: the constant j 
takes on the value However, the magnitude of the spin of the particle in 

these states is given by J + 1) ft = V?> ft/2. 

3. The angular momentum j — 1 states are denoted by 11, 1), [ 1, 0), and 11, — 1). 
These spin-1 states are represented by the column vectors (3.30) in the example 
of this section. The eigenvalues of J z are ft, 0, and —ft, which are the diagonal 
matrix elements of the matrix representing J z in (3.28). The magnitude of the 
angular momentum for these states is given by V’K I +Tj ft = V2 ft. 

4. There are four j = | states: ||, |), ||, ^), ||, —|), and ||, —|). The magni¬ 
tude of the angular momentum is + 1) ft = x/l5 ft/2. 

i f; 

As these examples illustrate, the magnitude s/Jij + 1) ft of the angular momen¬ 
tum is always bigger than the maximum projection jh on the z axis for any nonzero 
angular momentum. In Section 3.5 we will see how the uncertainty relations for an¬ 
gular momentum allow us to understand why the angular momentum does not line 
up along an axis. 


i- ' *. * ‘ -i-" 1 - -r £2Z~. .s',* •_ • v'fV •• rsrse . . ** f, &.-SS2MF > * 

EXAMPLE 3.2 An atom passes straight through an SGz device without 
deflecting. What can you deduce about the angular momentum of the atom? 

SOLUTION Since the atom is not deflected, it must have J. = 0. Thus the 
atom has an integral value j for its angular momentum, since only for integral 
values of j is m = 0 one of the eigenvalues for J z . 
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3.4 The Matrix Elements of the Raising and Lowering Operators 


We have seen in (3.40) and (3.42) that the action of the raising operator J + on a state 
of angular momentum j is to create a state with the same magnitude of the angular 
momentum but with the z component increased by one unit of h: 

J+\j, m) = c + h\j, m + 1) (3.55) 

while the action of the lowering operator is 

J-\j, m) = c_h\j, m - 1) (3.56) 

It is useful to determine the values of c + and c_. Taking the inner product of the 
ket (3.55) with the corresponding bra and making use of (3.38), we obtain 

O', m\J_J + \j, m) = c* + c + h 2 (j, m + 1| j, m + 1) (3.57) 

Substituting (3.47) for the operators J_J + , we find 

o, m |(J 2 - J 2 - hJ z )\j, m) = [j(j + 1) - m 2 - m]h 2 (j, m\j, m) 

— c* + c + h 2 (j , m + 1|j, in + 1) (3.58) 

Assuming the angular momentum states satisfy (j. m\j, m) = (j, m + l|y, m + 1), 

we can choose c + = Pj(j + 1) — m(m. + 1), or 

j+\j, m) = y/j(j + 1) - m(m + 1) h\j, m + 1) (3.59) 

Note that when m = j, the square root factor vanishes and the raising action termi¬ 
nates, as it must. Similarly, we can establish that 

P\j, m) = vO'O + 1) - m(m - 1) h\j, m - 1) (3.60) 

for which the square root factor vanishes when m = — j, as it must. 

These result s determine the matrix elements of the raising and lowering operators 
using the states | j, m) as a basis: 

07 m'\J + \j, m) = Jj(j + 1) - m(m + I) h O', m'\j, m + 1) 

= v7(7 + 1 ) - m (m + 1 ) h 8 m f m+ 1 (3.61) 

and 

(j, m\J_\j, in) = v 7 j(j + 1) - m{m - 1) h O', m'\j, m - 1) 

= V j(j + 1) - mini - 1) h h m , m _ x (3.62) 

In obtaining these matrix elements, we have made use of (j, m'\j , m) — 8 m > m , since 
the amplitude to find a state having J z — mh with 7, = m'h. m' ^ m, is zero. In 
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Section 3.6 we will see how useful the matrix elements (3.61) and (3.62) are for 
obtaining matrix representations of J x and J y . 


EXAMPLE 3.3 Obtain the matrix representation of the raising and lower¬ 
ing operators using the j = 1 states as a basis. 

SOLUTION The three j = 1 basis states are |1) = |1, 1), |2) = 11, 0), and 
|3) = 11, -1). Using (3.61), we see that J + 11, 1) = 0, ,/ + 11, 0) = ~Jl h\ 1, 1), 
and i + |l, —1) = V2h\l, 0). Thus the only nonzero matrix elements are 
(1|/+|2> = <1, l\J + \l,0) = V2h in the first row, second column and 
(2|7 + |3) = (1, 0|J + |1, —1) = V2 h in the second row, third column: 


J 7 basis 


V2H 


/0 1 0 \ 
0 0 1 
Vo o o/ 


Since J_ — ./}. the matrix representation of J_ is the transpose, complex 
conjugate of the matrix representation for ■>+- 


sflh 


/0 
1 

Vo 


0 O' 
0 0 
1 0 


These results are in agreement with (3.32) and (3.36), showing that the 3x3 
matrix representations in Section 3.3 are indeed those for j — 1. 




3.5 Uncertainty Relations and Angular Momentum 


In solving the angular momentum problem in Section 3.3, we took advantage of the 
commutation relation (3.24) to form simultaneous eigenstates of J 2 and Since 
[J 2 , ./, | = 0 as well, we can also form simultaneous eigenstates of J 2 and J x . For 
the j = | sector, the two eigenstates would be the states |+x) and |— x) that we 
discussed in the earlier chapters. We did not. however, try to form simultaneous 
eigenstates of J 2 , J x , and J z . We now want to show that such simultaneous eigenstates 
are prohibited by the commutation relations of the angular momentum operators 
themselves, such as 


[J x ,J y \ = ihJ z (3.63) 

This is why in Section 3.3 we chose only one of the components of J, together with 
the operator J 2 , to label the eigenstates. 
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The commutation relation (3.63) is an example of two operators that do not 
commute and whose commutator can be expressed in the form 

[A, B]=*iC (3.64) 


where A, B, and C are Hermitian operators. We will now demonstrate that a 
commutation relation of the form (3.64) implies a fundamental uncertainty relation. 
To derive the uncertainty relation, we use the Schwarz inequality 

(a|a)</I|/J}>|<a|/l}| 2 (3.65) 

This is the analogue of the relation (a • a)(b • b) > (a • b) 2 , familiar from the ordinary 
real three-dimensional vector space. See Problem 3.7 for a derivation of (3.65). 

We substitute 

\a) = (A-(A)M) (3.66a) 

|/1) = (B - (B))\f) (3.66b) 

into (3.65), where the expectation values 

(A) = W\A\1r) (3.67a) 

and 

(B) = (f\B\f) (3.67b) 

are real numbers because the operators are Hermitian. Notice that 

ia\a) = W(A - <A» 2 |t/r> = (AA) 2 (3.68a) 

(m = W(B - (B)) 2 W) = (A B) 2 (3.68b) 

where we have used the familiar definition of the uncertainty (see Section 1.4 or 
Section 1.6) and the fact that A and B are Hermitian operators. The right-hand side 
of the Schwarz inequality (3.65) for the states (3.66) becomes 


(a\P) = {f\(A-{A))(B-(B))\1r) 


For any operator O , we may write 


6 = 


0 + 6 ^ 0-6 f 

2 + 2 



(3.69) 


(3.70) 


where F — 6 + and G = —i(6 — O t ) are Hermitian operators. If we take the 
operator 6 to be (A — {A))(B — (B)), we find 


6 - <3 f = [A, B] = iC 


(3.71) 
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and therefore G = C in (3.70). Thus 


MW 


^(f\F\f) + l -(f\C\f) 


M\F\W | |(l/r|C|l/r)| 2 > |(C)1 2 


(3.72) 


where we have made use of the fact that the expectation values of the Hermitian 
operators F and C are real. Combining (3.65), (3.68), and (3.72), we obtain 


(AA) 2 (AB) 2 > -^1 


(3.73) 


or simply 


AAAB 


1(C) | 


(3.74) 


which is a very important result. 

If we apply this uncertainty relation to the specific commutation relation (3.63), 
we find 7 


AJ X AJ V > -|(7. 


(3.75) 


This uncertainty relation helps to explain a number of our earlier results. If a spin-7 
particle is in a state with a definite value of J z , (./.) is either h/2 or —/i/2, which 
is certainly nonzero. But (3.75) says that A J x must then also be nonzero, and thus 
the particle cannot have a definite value of J x when it has a definite value of 
We now see why making a measurement of S z in the Stern-Gerlach experiments is 
bound to modify subsequent measurements of S x . We cannot know both the x and 
the z components of the angular momentum of the particle with definite certainty. 
We can also see why in general the angular momentum doesn’t line up along any 
axis: If the angular momentum were aligned completely along the z axis, both the .r 
and y components of the angular momentum would vanish. We would then know all 
three components of the angular momentum, in disagreement with the uncertainty 
relation (3.75), which requires that both A J x and A J y are nonzero in a state with a 
definite nonzero value of ./,. Thus the angular momentum never really “points” in 
any definite direction. 


7 In Chapter 6 we will see that the position and momentum operators satisfy 




Thus (3.74) leads directly to the famous Heisenberg uncertainty relation Ax A p x > h/2 as well. 
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3.6 The Spin-^ Eigenvalue Problem 


In this section we wili see how we can use the results of this chapter to derive the 
spin states of a spin-j particle that we deduced from the results of Stern-Gerlach 
experiments in Chapter 1. First we will make a small change in notation. It is 
customary in discussing angular momentum to call the angular momentum operators 
J x , J y , and ./. in general. We have introduced these operators as the generators of 
rotations. The commutation relations that we used in Section 3.3 depended only 
on the fact that rotations about different axes do not commute in a well-defined 
way. Our formulation is general enough to include all kinds of angular momentum, 
both intrinsic spin angular momentum and orbital angular momentum. That is one 
of the major virtues of introducing angular momentum in this way. In Chapter 9 
we will see that for orbital angular momentum—angular momentum of the r x p 
type—only integral j's are permitted. If our discussion of angular momentum is 
restricted to purely orbital angular momentum, it is conventional to denote the 
angular momentum operators by L x , L y , and L z . On the other hand, if our discussion 
is restricted to intrinsic spin angular momentum, it is customary to call the spin 
angular momentum operators S x , S Y , and .S'.. Our discussion in Chapters 1 and 2 
of the intrinsic spin angular momentum of particles like electrons and photons was 
restricted to angular momentum of the latter sort. Thus, we could return to Chapter 2. 
where we first introduced the generator of rotations about the z axis, and relabel J z to 
S z , because we were strictly concerned with rotating intrinsic spin states. In addition 
to renaming the operators for intrinsic spin, it is also common to relabel the basis 
states as |.v, m), where 

S 2 1,v, m) = s(s + l)h 2 \s y m) (3.76a) 

S 2 |s, m) = mli\s, m) (3.76b) 

For a spin -5 particle, s = 5 and there are two spin states, |j, 5 ) and |2, 

Before solving the eigenvalue problem for a spin-4 particle, it is useful to 
determine the matrix representations of the spin operators S x , S y , and S z . We will use 
as a basis the states \ \, \) — |+z) and ||, — |) = |—z) that we found in Section 3.3. 
In fact, we already determined the matrix representation of S z in this basis in 
Section 2.5. Of course, we were calling the operator J z then. In agreement with 
(2.70) we have 


/<+z|S 2 |+z) (+z|S 2 |-z)\ = h / 1 0 \ 

V<—z|S.|+z) <-z|S z |—z> ) ~ 2 VO -l) 


in the S, basis. 

In order to determine the matrix representations for S x and S , we start with 
the matrix representations of the raising and lowering operators S + and 5_, whose 
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action on the basis states we already know. Forming the matrix representation in the 
S z basis for the raising operator using (3.61), we have 


- / <+z|S + |+z> <+z|S+|-z>\ = /0 1\ 

+ ^ V(-z|S + |+z> (—z|S + |—z) / _ V 0 0/ 

reflecting the fact that 


S + \+z) = S + \±, |>=0 


(3.78) 


(3.79) 


5 + |-z> = 5 + ||,-|) 

= (l + ! ) ~ (~t) (-5 + l) h\b \) 

= h\\,\) = n\+z) (3.80) 


Also, the matrix representation of the lowering operator in the S, basis can be 
obtained from (3.62): 

. (+,|S_|-,>X /0 ox 

V(-Z|S_|+2) (—z|S _|—%) / \1 0 ) 

reflecting the fact that 

S-l—z) = S_||, -|)=0 (3.82) 

and 

S_l+z) = S_|£, |) 

= h\\, -\) = h\-z) (3.83) 


As a check, note that since S + = S_, we could also obtain (3.81) as the transpose, 
complex conjugate of the matrix (3.78). Recall (2.80). 

With the matrix representations for S + and 5_, determining the matrix represen¬ 
tations of S x and S y is straightforward. Since 


S + = S x + iS y (3.84) 

S_ = S x - iS y (3.85) 
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then 


and 




(3.86) 


(3.87) 


Using the matrix representations (3.78) and (3.81) in the S, basis, we obtain 


and 


h /0 

2 V 1 




(3.88) 


(3.89) 


The three 2x2 matrices in (3.88), (3.89), and (3.77) (without the factors of fi/2) 
are often referred to as Pauli spin matrices and are denoted by a x , a y , and a z , 
respectively. These three equations can then be expressed in the vector notation 

S -> -<r (3.90) 


where S = S x i + S y j + S,k and a = er v i + rr v ,j + cr z k. 

We are now ready to find the eigenstates of S x or S y . In fact, we can use the 
matrix representations (3.90) to determine the eigenstates of S„ = S ■ n and thus find 
the states that are spin up and spin down along an arbitrary axis specified by the unit 
vector n. We will restrict our attention to the case where n = cos </>i + sin </>j lies in 
the x-y plane, as indicated in Fig. 3.7. The choice 0 = 0 (<j> — n/2) will yield the 
eigenstates of S x (S y ) that we used extensively in Chapters 1 and 2. We will leave the 
more general case to the Problems (in particular, see Problem 3.2). We first express 
the eigenvalue equation in the form 


S a \n) = n-\n) (3.91) 

where, as we did earlier in our general discussion of angular momentum, we have 
included a factor of h so that /r is dimensionless. The factor of A in the eigenvalue 
has been included to make things turn out nicely. After all, we know the eigenvalues 
already. Since the eigenvalues of S z are ±h/2 and since our choice of the z axis 
is arbitrary, these must be the eigenvalues of S n as well. Equation (3.91), however, 
does not presume particular eigenvalues, and we will see how solving the eigenvalue 
problem determines the allowed values of /x. 
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z 



Figure 3.7 The spin-up-along-n state, where 
n = cos (pi + sin 0j. 


As in (2.63), we obtain two equations that can be expressed in matrix form by 
taking the inner product of (3.91) with the two bra vectors <+z| and <-z|: 


ft 

9 


0 1 \ /0 -|\ 

) cos 0 “F I sin 0 

10/ Vi 0 / 


{+z|/r> \ _ n / (+z|m) 
< z|/x> / M 2\(-z|m) 


(3.92) 


where the 2x2 matrix on the left-hand side is just the matrix representation of 
S n = S x cos 0 + S y sin 0. Dividing out the common factor of ft/2, we can write this 
equation as 


-M <r f * 
e" 1 ’ -pi 


(+z|A> \ q 

(—z|m) / 


(3.93) 


This is a homogeneous equation in the two unknowns (+z|/x) and (—z|ju.). A non¬ 
trivial solution requires that the determinant of the coefficients vanishes. Otherwise, 
the 2 x 2 matrix in (3.93) has an inverse, and multiplying the equation by the inverse 
would leave just the column vector equal to zero, that is, the trivial solution. Thus 


-H e 
J4> 


-ip 

-ll 


(3.94) 


showing that fi 2 — = /r 2 - 1^= 0, or n = ±1. 

Now that we know the eigenvalues, we may determine the corresponding eigen¬ 
states. The state with pi = +1 is an eigenstate of S n with eigenvalue ft/2. Thus, 
in our earlier notation, it is the state |+n), and we can relabel it accordingly: 

|pi = 1) = |+n). Substituting pi — +1 into (3.93), we find that 

<-z|+n) = e'^(+z|+n) (3.95) 


The requirement that the state be normalized ({+n[+n} — 1) is satisfied provided 
that 


| (+z|+n)| 2 + |(-z|+n)| 2 = 1 


(3.96) 


Substituting (3.95) into (3.96), we find 

2|(+z|+n)| 2 = 1 (3.97) 


Page 113 (metric system) 



98 | 3. Angular Momentum 


Thus, up to an overall phase, we may choose (+z|+n) — 1/V2, which with (3.95) 
shows that (—z|+n) = e'^/\fl, or 


M 


l+n> = v! l+z) + vI 1-2 ’ 


(3.98) 


Note how, up to an overall phase, this result agrees with (2.41), which we obtained 
by rotating the state |+x) by an angle <p counterclockwise about the z axis, namely, 
|+n) = *(0k)|+x). 

The state with /r = -1 is an eigenstate of S„ with eigenvalue — h/2. We can thus 
relabel this state \fx — —\) = |—n). If we substitute the value n = — 1 into (3.93), we 
find that 


{—z|—n) = —e"^(+z|—n) 


Satisfying 


|(+z|-n}| 2 +|(-z|-n}| 2 =l 


(3.99) 


(3.100) 


we obtain 


|—n) 



e i<P 

V2 


-z) 


(3.101) 


These results are in agreement with our earlier forms for these states: setting 
4> = 0 in (3.98) and (3.101) yields 


!±x} = - 7 =|+z}±--|-z) 

v2 V2 


while setting <p = n/2 yields 


l±J) => z>± ^ | - z) 


(3.102) 


(3.103) 


However, in deriving (3.102) and (3.103) here, we have not had to appeal to the 
results from the Stem-Gerlach experiments. We have relied on only the commutation 
relations of the generators of rotations and their identification with the angular 
momentum operators. In a similar fashion, we can work out the spin eigenstates of a 
particle with arbitrary intrinsic spin 5. In this latter case, because there are 2s + I spin 
states for a particle with intrinsic spin s f the corresponding eigenvalue problem will 
involve {2s + 1) x (2s + 1) matrices. The procedure for determining the eigenstates 
and corresponding eigenvalues is the same as we have used in this section, but the 
algebra becomes more involved as the dimensionality of the matrices increases. 
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EXAMPLE 3.4 Determine the matrix representation for .S', using the spin-- 
states as a basis. 

SOLUTION For $ = |, there are four basis states, namely ||, |), ||, |>, 
| j , —\), and 11These four states are eigenstates of S 2 with eigenvalue 
| (| + \)h 2 as well as being eigenstates of ,S, with eigenvalues |fi, 
and — \h, respectively. 

Using 

5 + |5, m) = y/s(s I 1) - m(in + 1) fi\s, m + 1} 


we see that 

S + \l,i) = V3h\ll) 

5 + [|-|) = 

£ + |§,-§) = 73fi||,-£} 

Thus the matrix representation for .S’ + is given by 


(0 

73 

0 

0 ^ 

0 

0 

2 

0 

0 

0 

0 

V3 

u 

0 

0 

o ) 


The matrix representation of S_ is the transpose, complex conjugate of this 
matrix, namely 


L o 

0 

0 

0) 

73 

0 

0 

0 

0 

2 

0 

0 

l 0 

0 

73 

o) 


Thus the matrix representation of S x is given by 


1 

( 0 

73 

0 

0 \ 

i » h 

73 

0 

2 

0 

S X = ~(S + + S~)^~ 

0 

2 

0 

73 


l 0 

0 

73 

0 ) 


99 
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3.7 A Stern-Gerlach Experiment with Spin-1 Particles 


Let’s return to the sort of Stern-Gerlach experiments that we examined in Chapter 1, 
hut this time let’s perform one of these experiments with a beam of neutral spin-1 
instead of spin-| particles. Since the 2 component of the angular momentum of 
a spin-1 particle can take on the three values ft, 0. and —ft, an unpolarized beam 
passing through an SGz device splits into three different beams, with the particles 
deflected upward, not deflected at all, or deflected downward, depending on the value 
of S z (see Fig. 3.8). 

What happens if a beam of spin-1 particles passes through an SGy device? An 
unpolarized beam should split into three beams since S v can also take on the three 
values ft, 0, and -ft. If we follow this SGy device with an SGz device, we can ask, 
for example, what fraction of the particles with S y = ft will be found to have S, = ft 
when they exit the SGz device (see Fig. 3.9)? Unlike the case of spin |, where it was 
“obvious” for two SG devices whose inhomogeneous magnetic fields were at right 
angles to each other that 50 percent of the particles would be spin up and 50 percent 



Figure 3.8 A schematic diagram indicating the paths that a spin-1 
particle with S, equal to ft, 0, or -ft would follow in a Stern-Gerlach 
device. 



Figure 3.9 A block diagram for an experi¬ 
ment with spin-1 particles with two SG de¬ 
vices whose inhomogeneous magnetic fields 
are oriented at right angles to each other. What 
fraction of the particles exiting the SGy device 
with S y = ft exits the SGz device in each of 
the three channels? 
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would be spin down when they exited the last SG device, here the answer is not 
so clear. In fact, you might try guessing how the particles will be distributed before 
going on. To answer this question, we need to calculate the amplitude to find a particle 
with S y = h in a state with S z = H, that is, to calculate the amplitude z (l, 1| 1, l) r 
where we have put a subscript on the ket and bra indicating that they are eigenstates 
of S y and S z , respectively. A natural way to determine the amplitude Z {1, 1| 1, l) y is 
to determine the eigenstates of S y for a spin-1 particle in the S z basis. We use the 


Sy 


h 

V2 


The eigenvalue equation 


(3.28): 



/0 -i 

° 1 


i 0 

—i 

(3.104) 

V0 i 

0 i 


= pth\i, n) 

y 

(3.105) 


becomes the matrix equation 

/ 0 -f 0 



< a \ 
b 

\c / 


: Hft 


< a 
b 


(3.106) 


which can be expressed in the form 


/ -n 

- i/42 

0 \ /a\ 


i/42 

-yti 

-if s/2 6 =0 

(3.107) 

V o 

if 42 

-4 A c) 



Note that we have represented the eigenstate by the column vector 



( id \ 

( a \ 

1 ? ft)y ^ 

AlOdriy = 

b ] 





(3.108) 


in the .S’, basis, where we have used a, b, and c for the amplitudes for notational con¬ 
venience. As we discussed in the preceding section, a nontrivial solution to (3.106) 
requires that the determinant of the coefficients in (3.107) must vanish: 


-M 

i/4 2 
0 


-1/42 

~/x 

i/4 2 


0 

-i/42 

-lx 


(3.109) 


showing that -q(q 2 — 4 + (i/42)(—in/s/l) = 0, which can be written in the 
form n(fi 2 — 1) = 0. Thus we see that the eigenvalues are indeed given by ii equals 
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1, 0, and — 1, corresponding to eigenvalues h , 0, and -H for 5 V , as expected. If 
we now, for example, substitute the eigenvalue fi = 1 into (3.106), we obtain the 
equation 


1 

V2 


(0 -i 0 \ 

/ a\ 

< a\ 

/ 0 -I 

* " 

b 

O 

O 

U/ 

) 


indicating that for this eigenstate 


—ib = \fia ia~ic=\flb and ib=s/lc 


(3.110) 


(3.111) 


From the first and last of these equations we see that e — -a. Since b = i-Jla, the 
column vector in the S z basis representing the eigenstate of S y with eigenvalue h is 
given by 


II, 1), 


S z basis 


( a \ 

i-Jla 


V -<* / 

The requirement that the state be normalized is 


(a*, -a*) 


( a 

i-Jla | = 4|u| 2 = 1 
V -a 


(3.112) 


(3.113) 


Thus, up to an overall phase, we can choose a = |, showing that 


11 , 1 ) y - 

S z basis 


1 

2 


( 1 \ 

/V2 

V -i / 


(3.114) 


or, expressed in terms of kets. 


II, l >.v = ^U, l) + /^il.0} - ~\l, -1) (3.115) 

Note that we have not put subscripts on the kets on the right-hand side of (3.115) 
because, if there is no ambiguity, we will use the convention that without subscripts 
these are understood to be eigenkets of S z . 

Based on our result, we can now ascertain how a beam of spin-1 particles exiting 
an SGy device in the state |1, l) v , that is, with S y - h, will split when it passes 
through an SGz device. The probability of the panicles exiting this SGz device 
with S z — h is given by |(1, 1|1, l) v | 2 = ||j 2 = the probability of the particles 
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p-- 

Sy=h 


SGy 

—I No 

-H 

SGz 


N 0 I4 

N 0 I2 

N 0 /4 


S z = 0 


Figure 3.10 A block diagram showing the results of the Stern- 
Gerlach experiment with spin-1 particles. 


exiting this SGz device with 5, = 0 is given by j (1, 0| 1 , l) v | 2 = | i \fl/2\ 2 = and 
the probability that the particles exit the SGz device with S z = —h is given by 
1(1, — 111, l) v | 2 = |—1| 2 = So when a beam of spin-1 “spin-up” particles from 
one SG device passes through another SG device whose inhomogeneous magnetic 
field is oriented at righl angles to that of the initial device, 25 percent of the particles 
are deflected up, 50 percent of the particles are not deflected, and 25 percent of 
the particles are deflected down (see Fig. 3.10). This is to be compared with the 
50 percent up and 50 percent down that we saw earlier for spin-^ particles in a 
similar experiment. 


EXAMPLE 3.5 Determine the fraction of spin-1 particles exiting the SGy 
device with S v = 0 that exits the SGz device in each of the three channels, 
namely with S z = h, S, = 0, and .S', = —h. 


SOLUTION Return to (3.106) and put — 0, which shows that b ■■ 
a = c. Thus, the normalized eigenstate with S v = 0 is 


: 0 and 


11 . 0 ) 


or, expressed in terms of kets. 


■'*' S z basis 




1 / 


11*o) v = —!=li' i> + 4=lii 

V2 v2 


- 1 ) 


Therefore |(1, 1|1, 0) v | 2 = |<1, —1|1, 0) v | 2 = |l/%/2| 2 = 1/2. Thus 50 per¬ 
cent of the particles exit the SGz device with S, — h and 50 percent exit with 
= -h. 

The results of this chapter may convince you that it is not easy to predict 
the results of Stern-Gerlach experiments without a detailed calculation. If 
you need more evidence, try your hand at Problem 3.22 or Problem 3.25. 
where a beam of spin- 2 particles is sent through a series of SG devices. 
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3.8 Summary 


To a physicist, angular momentum along with linear momentum and energy consti¬ 
tute the “big three” space-time dynamical variables used to describe a system. 8 An¬ 
gular momentum enters quantum mechanics in the form of three operators— J x , ,/ v , 
and J,—that generate rotations of states about the x, y, and 2 axes, respectively. 
Because finite rotations about different axes do not commute, the generators satisfy 
the commutation relations 

\J X , Jy\ = ihl [ J y ,j z \ = ihJ x J x ] = ihJ y (3.116) 

where the commutator of two operators A and B is defined by the relationship 

[A, B\ = AB — BA (3.117) 

Although the three generators J x , J y , and J z do not commute with each other, 
they each commute with 

J 2 = J 2 + + i; (3.118) 

Thus, we can find simultaneous eigenstates of J 2 and one of the components, for 
example, J z . These eigenstates are denoted by the kets | j, m) where 

J 2 |;» m) = j(j + l)h 2 \j, m) (3.119a) 

J z \j,m) = mh\j,m) (3.119b) 

Physically, we can see why J 2 and J, commute, since the eigenvalue for J 2 specifies 
the magnitude of the angular momentum for the state and the magnitude of the 
angular momentum, like the length of any vector, is not affected by a rotation. 

The linear combination of the generators 


J+ — h + ' Jy 

(3.120) 

is a raising operator: 


./+!./, rn) =s/j(j + 1) - m(m + 1 ) h\j, m + 1) 

(3.121) 

whereas ./_ = J x — i J v is a lowering operator: 


J~\ji m) = sj j{j + 1) - m(m - 1) h\j, m - 1) 

(3.122) 


8 Relativistically, we could term them the big two, grouping linear momentum and energy 
together as an energy-momentum four-vector. The importance of these variables arises primarily 
because of the conservation laws that exist for angular momentum, linear momentum, and energy, 
tn Chapter 4 we will begin to see how these conservation laws arise. Intrinsic spin angular 
momentum plays an unusually important role, which we will see when we consider systems of 
identical particles in Chapter 12. 
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Since the magnitude of the projection of the angular momentum on an axis for a state 
must be less than the magnitude of the angular momentum itself, there are limits on 
how far you can raise or lower the m values, which are sufficient to determine the 
allowed values of j and m: 

./=0, ^,1,^,2, ... (3.123) 

and for any particular j, m ranges from + j to —j in integral steps: 

m = j. j - 1, j — 2,- -j + 1, -j (3.124) 

The eigenstates of J n = J ■ n, the component of the angular momentum along an 
axis specified by the unit vector n, can be determined by setting up the eigenvalue 
equation 

J n \j, m) n = mh\j\ m) n (3.125) 

using the eigenstates of /, as a basis. Since for a particular /'. there are 2 j + 
1 different states | j, m), the eigenvalue equation (3.125) can be expressed as a 
matrix equation with the matrix representation of ./„ = J n = J x n x + J y n y + J z n z 
following directly from (3.119b), (3.120), (3.121), and (3.122). As an important 
example, the matrix representations for spin 5 are given by 

S-► -<r (3.126) 

S z basis 2 

with the Pauli spin matrices 

0) °'~C 0) a,,d a - = ('o -1) ai27> 

In (3.126) we have labeled the angular momentum operators by S instead of J, 
because when j = | we know that we are dealing with intrinsic spin. 

Finally, when two Hermitian opefators do not commute, 

[A,B] = iC (3.128) 

there is a fundamental uncertainty relation 

AAAB > ~~ (3.129) 

From this result follows uncertainty relations for angular momentum such as 

AJ x AJ y > h -\(J z )\ (3.130) 

If the z component of the angular momentum has a definite nonzero value, making 
the right-hand side of (3.130) nonzero, then we cannot specify either the x or 
y component of the angular momentum with certainty, because this would require the 
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left-hand side of (3.130) to vanish, in contradiction to the inequality. This uncertainty 
relation is, of course, built into our results (3.123) and (3.124), which, like (3.130), 
follow directly from the commutation relations (3.116). Nonetheless, uncertainty 
relations such as (3.130) bring to the fore the sharp differences between the quantum 
and the classical worlds. In Chapter 6 we will see how (3.128) and (3.129) lead to 
the famous Heisenberg uncertainty relation Ax Ap x > h/2. 

Problems 


3.1. Verify for the operators A, B. and C that 

(a) [A, B + C] = [A, B] + [A,C] 

(b) [A, BC] = B[A, C] + [A, B]C 
Similarly, you can show that 

(c) [AB, C] = A[B, C] + [A, C\B 

3.2. Using the |+z) and |— z) states of a spin-| particle as a basis, set up and solve as a 
problem in matrix mechanics the eigenvalue problem for S n = S ■ n, where the spin 
operator 8 = 5,4 + S v j + S z k and n = sin 0 cos <j>i + sin 9 sin 0j + cos 0k. Show 
that the eigenstates may be written as 

|+n) = cos — |+z) + e'^ sin z ) 

0 0 
|—n) = sin — |+z) — e lli> cos ;z) 

Rather than simply verifying that these are eigenstates by substituting into the 
eigenvalue equation, obtain these states by directly solving the eigenvalue problem, 
as in Section 3.6. 

3.3. Show that the Pauli spin matrices satisfy + o J a l = 2<5,- ; I, where / and j 
can take on the values 1, 2, and 3, with the understanding that a l = a x , a 2 = cr y , 
and er 3 = cr 2 . Thus for i = j show that ct ? 2 = ct 2 = rr 2 = I, while for i j show 
that {a h ctj ) =0, where the curly brackets are called an anticomniutator, which 
is defined by the relationship {A, B] = AB + BA. 

3.4. Verify that (a) a x a — 2 ia and (b) a ■ a a ■ b = a • b I + ia ■ (a x b), where 
a = a x i + o y j + or z k. 

3.5. This problem demonstrates another way (also see Problem 3.2) to determine 
the eigenstates of S„ = S • n. The operator 

R(6 j) = e ~ iS y e/h 

rotates spin states by an angle 6 counterclockwise about the y axis. 
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(a) Show that this rotation operator can be expressed in the form 

e 2i - . e 

R(6 1 ) — cos- S v sin - 

J 2 ft > 2 

Suggestion: Use the states |+z) and |— z) as a basis. Express the operator 
R(0j) in matrix form by expanding R in a Taylor series. Examine the explicit 
form for the matrices representing S'*, S*, and so on. 

(b) Apply R in matrix form to the state |+z) to obtain the state |+n) given in 
Problem 3.2 with 0 = 0, that is, rotated by angle 9 in the x-z plane. Show that 
R |—z) differs from |—n) by an overall phase. 

3.6. Derive (3.60). 

3.7. Derive the Schwarz inequality 

(a\a){W) >\(<xm 2 

Suggestion: Use the fact that 

((ff|+r0|)(|a> + U J S))>O 

and determine the value of A that minimizes the left-hand side of the equation. 

3.8. Show that the operator C defined through (A, B] = i C is Hermitian, provided 
the operators A and B are Hermitian. 

3.9. Calculate A S x and A S v for an eigenstate of S z for a spin-4 particle. Check to see 
if the uncertainty relation AS x AS y > h\(S z )\/2 is satisfied. Repeat your calculation 
for an eigenstate of S x . 

3.10. Use the matrix representations of the spin-| angular momentum operators S x , 
S y , and S z in the .S', basis to verify explicitly through matrix multiplication that 

[S x , S y ] = iHS z 

3.11. Determine the matrix representations of the spin-^ angular momentum opera¬ 
tors S x . S y , and S z using the eigenstates of S y as a basis. 

3.12. Verify for a spin- ' particle that (a) 

S z = (h/2 )|+z>(+z| - (fi/2)|-z)(-z| 

and (b) the raising and lowering operators may be expressed as 

S + = h\+z){- z| and S_ = fi|—z)(+z| 

Note: It is sufficient to examine the action of these operators on the basis states |+z) 
and |— z), which of course form a complete set. 
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3.13. Repeat Problem 3.10 using the matrix representations (3.28) for a spin-1 
particle in the /, basis. 

3.14. Use the spin-1 states jl, 1), jl, 0), and 11, — 1) as a basis to form the matrix 
representations of the angular momentum operators and hence verify that the matrix 
representations (3.28) are correct. 

3.15. Determine the eigenstates of S x for a spin-1 particle in terms of the eigenstates 
|1, 1), |1, 0>, and il, -l)ofS z . 

3.16. A spin-1 particle exits an SGz device in a state with S z = h. The beam then 
enters an SGx device. What is the probability that the measurement of S x yields the 
value 0? 


3.17. A spin-1 particle is in the state 


I f) 


S z basis Vu 



(a) What are the probabilities that a measurement of S z will yield the values h, 0, 
or -h for this state? What is {A,}? 

(b) What is (S x ) for this state? Suggestion: Use matrix mechanics to evaluate the 
expectation value. 

(c) What is the probability that a measurement of S x will yield the value h for 
this state? 


3.18. Determine the eigenstates of S„ = S ■ n for a spin-1 particle, where the spin 
operator S = .S’ c i + 5 v j + S,k and n = sin 9 cos cp i + sin 9 sin <p j + cos (p k. Use the 
matrix representation of the rotation operator in Problem 3.19 to check your result 
when <p = 0. 


3.19. Find the state with S n — h of a spin-1 particle, where n = sin 9 i + cos 9 k, 
by rotating a state with S z = h by angle 9 counterclockwise about the y axis using 
the rotation operator R(9j) = e lS y®9\ Suggestion: Use the matrix representation 
(3.104) for S y in the S, basis and expand the rotation operator in a Taylor series. 
Work out the matrices through the one representing .S' 3 in order to see the pattern 
and show that 


%j) 


S z basis 


i + COS 0 

~2 
sin 9 




1 


s/2 

~ cos 9 


sin 9 

cos 9 
sin 9 


■ cos 0 


2 

sin 9 


s/2 

1 + cos 9 


) 


3.20. A beam of spin-1 particles is sent through a series of three Stem Gerlach 
measuring devices (Fig. 3.11). The first SGz device transmits particles with S z = h 
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Figure 3.11 A Stern-Gerlach experiment with spin-1 particles. 


and filters out particles with S, = 0 and S z = —h. The second device, an SGn device, 
transmits particles with S n — h and filters out particles with S n = 0 and S n = — h, 
where the axis n makes an angle 0 in the x-z plane with respect to the z axis. A last 
SGz device transmits particles with S, = —h and filters out particles with S, = h and 
S, = 0. 

(a) What fraction of the particles transmitted by the first SGz device will survive 
the third measurement? Note: The states with S„ = h, S n = 0, and S n = —h 
in the S z basis follow directly from applying the rotation operator given in 
Problem 3.19 to states with S z = h, S z = 0, and S z = —h, respectively. 

(b) How must the angle 0 of the SGn device be oriented so as to maximize the 
number of particles that are transmitted by the final SGz device? What fraction 
of the particles survive the third measurement for this value of 01 

(c) What fraction of the particles survive the last measurement if the SGn device 
is removed from the experiment? 

Repeat your calculation for pails (a), (b), and (c) if the last SGz device transmits 
particles with S z = 0 only. 

3.21. Introduce an angle 9 defined by the relation cos (9 = 7 Z /|J|, reflecting the 
degree to which a particle’s angular momentum lines up along the z axis. What 
is the smallest value of 0 for (a) a spin-| particle, (b) a spin-1 particle, and (c) a 
macroscopic spinning top? 

f/ 

3.22. Arsenic atoms in the ground state are spin- 4 particles. A beam of arsenic atoms 
enters an SGx device, a Stern Gcrlach device with its inhomogeneous magnetic field 
oriented in the x direction. Atoms with S x = then enter an SGz device. Determine 
the fraction of the atoms that exit the SGz device with S z = | fi, S z = \h, S z = 

and S z = — f ft. 

3.23. For a spin-4 particle the matrix representation of the operator S x in the S. 
basis is given by 


( 0 

V3 

0 

0 ^ 

V3 

0 

2 

0 

0 

2 

0 

-s/3 

l 0 

0 

x/3 

o ) 
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Pick one of the following states and verify that it is an eigenstate of S x with the 
appropriate eigenvalue: 



( 1 ^ 


/ 43 \ 

3 , 1 

2 ’ 2)j: 2\/2 

43 

i 3 1 \ 1 

1 

43 

l5 ' j) '^2V2 

-1 


11 ) 


1-43/ 



/43? 


/ 1 \ 

3 1) 1 

2 ’ 2X 242 

-1 

|3 3^ 1 

2 ' 2 - r 242 

-43 

-1 

43 


\V3) 


V -1 / 


Do you notice any property of these representations that is at least consistent with 
the other states being correct? 

3.24. A spin-4 particle is in the state 

/ / \ 

2 

| t/r) ->• N 

S z basis 3 

(a) Determine a value for N so that |i//) is appropriately normalized. 

(b) What is (S x ) for this state? Suggestion: The matrix representation of S x is 
given in Example 3.4. 

(c) What is the probability that a measurement of S x will yield the value h/2 for 
this state? Suggestion: See Problem 3.23. 

3.25. 

(a) Determine the matrix representation for S y for a spin-f particle. 

(b) Determine the normalized eigenstate of S y with eigenvalue jh. 

(c) As noted in Problem 3.22, arsenic atoms in the ground state are spin-| 
particles. A beam of arsenic atoms with S y = enters an SGz device. 
Determine the fraction of the atoms that exit the SGz device with S z = \h , 
S, = |ft, S z = — \h, and S z = — | H. 

3.26. Show that if the two Hermitian operators A and B have a complete set of 
eigenstates in common, the operators commute. 

3.27. Show that 

e A+B^ e A e B 

unless the operators A and B commute. Problem 7.19 shows what happens if A and 
B do not commute but each commutes with their commutator [A, B |. 
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Time Evolution 


Most of the interesting questions in physics, as in life, concern how things change 
with time. Just as we have introduced angular momentum operators to generate 
rotations, we will introduce an operator called the Hamiltonian to generate time 
translations of our quantum systems. After obtaining the fundamental equation of 
motion in quantum mechanics, the Schrodinger equation, we will examine the time 
evolution of a number of two-state systems, including spin precession and mag¬ 
netic resonance of a spin-^ particle in an external magnetic field and the ammonia 
molecule. 


4.1 The Hamiltonian and the Schrodinger Equation 


We begin our discussion of time development in quantum mechanics with the time- 
evolution operator (7(f) that translates a ket vector forward in time: 

U(t)\nO)) = \if(t)) ( 4 . 1 ) 

where |t/r(0)) is the initial state of the system at time t = 0 and |t/r(f)) is the state 
of the system at time f. In order to conserve probability, 1 time evolution should not 
affect the normalization of the state: 

Momo) = ormuHt)u(t)\no)) = wmmo)) = 1 (4.2) 


1 In most applications of nonrelativistic quantum mechanics, the total probability of finding 
the particle doesn't vary in time. However, an electron could disappear, for example, by meeting 
up with its antipartiele, the positron, and being annihilated. Processes such as particle creation and 
annihilation require relativistic quantum field theory for their description. 


in 
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which requires 


&Ht)U(t) = 1 (4.3) 

Thus the time-evolution operator must be unitary. 

Just as we introduced the generator of rotations in (2.29) by considering an 
infinitesimal rotation, here we consider an infinitesimal time translation: 

U(dt)= l- ~H dt (4.4) 

h 

where the operator H is the generator of time translations. Clearly, we need an 
operator in order to change the initial ket into a different ket at a later time. This is 
the role played by H. Unitarity of the time-evolution operator dictates that H is a 
Hermitian operator (see Problem 4.1). 

We can now show that U satisfies a first-order differential equation in time. Since 

1 (4.5) 

then 

U(t + dt) - U(t) = dt'j UU) (4.6) 

indicating that the time-evolution operator satisfies 2 

ih^-U = HU {t) (4.7) 

dt 

We can also apply the operator equation (4.6) to the initial state \f(0)) to obtain 

ih^-lirit)) = H\f(t)) (4.8) 

dt 

This equation, known as the Schrodinger equation, is the fundamental equation of 
motion that determines how states evolve in time in quantum mechanics. Schrodinger 
first proposed the equation in 1926, although not as an equation involving ket vectors 
but rather as a wave equation that follows from the position-space representation of 
(4.8), as we will see in Chapter 6. 

If H is time independent, we can obtain a closed-form expression for 0 from a 
series of infinitesimal time translations: 


U(t + dt) = U(dt)U(t) = 


2 The derivative of an operator is defined in the usual way, that is, 

dU = U(t + At)-U(t) 
dt Ar-*o A t 
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r • . v-r.-V 

0(t) = lim 1-// ( — ) = e- ,tu/h (4.9) 

h \nJ\ 

where we have taken advantage of Problem 2.1. Then 

\if(t)) = e illt/n \f(0)) (4.10) 

Thus in order to solve the equation of motion in quantum mechanics when H is time 
independent, all we need is to know the initial state of the system |i/r(0)) and to be 
able to work out the action of the operator (4.9) on this state. 

What is the physical significance of the operator HI Like the generator of 
rotations, H is a Hermitian operator. From (4.4) we see that the dimensions of H 
are those of Planck’s constant divided by time—namely, energy. In addition, when 
H itself is time independent, the expectation value of the observable to which the 
operator H corresponds is also independent of time: 

{xlr(t)\H\f(t)) = (ir(0)\UHt)HU(t)\ir(0)) = (Q)\H \f(0)) (4.11) 

since H commutes with U? All of these things suggest that we identify H as the 
energy operator, known as the Hamiltonian. Therefore 

(E) = {f\H\i]r) (4.12) 

The eigenstates of the Hamiltonian, which are the energy eigenstates satisfying 

H\E) = E\E) (4.13) 

play a special role in quantum mechanics. The action of the time-evolution operator 
U (t) on these states is easy to determine using the Taylor series for the exponential: 

I E) 

\E) =e~ iEt/K \E) (4.14) 

The operator H in the exponent can simply be replaced by the energy eigenvalue 
when the time-evolution operator acts on an eigenstate of the Hamiltonian. Thus if 
the initial state of the system is an energy eigenstate, |y/(0)} = | E), then 

| f{t)) = e~ iA,/n \E) = e ~ iEtlh \E) (4.15) 

3 To establish that H commutes with U , use the Taylor-series expansion for U, as in (4.14). 
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The state just picks up an overall phase as time progresses; thus, the physical state 
of the system does not change with time. We often call such an energy eigenstate a 
stationary state to emphasize this lack of time dependence. 

You might worry that physics could turn out to be boring with a lot of empha¬ 
sis on stationary states. However, if the initial state | i/r(0)> is a superposition of 
energy eigenstates with different energies, the relative phases between these en¬ 
ergy eigenstates will change with time. Such a state is not a stationary state and the 
time-evolution operator will generate interesting time behavior. All we need to do 
to determine this time dependence is to express this initial state as a superposition 
of energy eigenstates, since we now know the action of the time-evolution operator 
on each of these states. We will see examples in Sections 4.3 and 4.5. 

4.2 Time Dependence of Expectation Values 


The Schrodinger equation permits us to determine in general which variables exhibit 
time dependence for their expectation values. If we consider an observable A , then 

~r{A) = ^-(f(t)\A\f(t)) 
dt dt 


dt 


dt 


(f(t )\) ( ~\ifr(t)) ) + (V' , (0l“l^(0) 


dt 


1 


—ih 


1 


-mtw A\f(t )> + w(o\a —h\ f{t)) + mo\—mt)) 


ih 


BA, 


dt 


i „ ai 

= ~{f(t)\W, A]\f(t)) + (f(t)\—~ IlKO) 
n dt 


(4.16) 


The appearance of the last term involving dA/dt in this equation allows for the 
possibility that the operator depends explicitly on time. Equation (4.16) shows that 
provided the operator corresponding to a variable does not have any explicit time 
dependence (dA/dt = 0). the expectation value of that variable will be a constant 
of the motion whenever the operator commutes with the Hamiltonian. 

What do we mean by explicit time dependence in the operator? Our examples 
in Sections 4.3 and 4.4 will probably illustrate this best. The Hamiltonian for a 
spin-j particle in a constant magnetic field is given in (4.17). There is no explicit 
t dependence in //; therefore substituting H for the operator A in (4.16) indicates 
that energy is conserved, since H of course commutes with itself. However, if we 
examine the Hamiltonian (4.34) for a spin -1 particle in a time-dependent magnetic 
field, we see explicit time dependence within the Hamiltonian in the factor cos cot. 
Such a Hamiltonian does not lead to an expectation value for the energy of the spin 
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system that is independent of time because dH/dt — 0. There is clearly an external 
system that is pumping electromagnetic energy into and out of the spin system. 

4.3 Precession of a Spin -5 Particle in a Magnetic Field 


As our first example of quantum dynamics, let’s consider the time evolution of the 
spin state of a spin- 5 particle in a constant magnetic field. We will choose the z axis 
to be in the direction of the magnetic field, B = B 0 k, and take the charge of the spin- 
i particle to be q = -e, that is, to have the same charge as an electron. The energy 
operator, or Hamiltonian, is given by 

H = —ji ■ B = —-^-S • B = -^—S z Bq = « 0 5 z < 4 - 17 ) 

2m c 2 me 

where we have used (1.3) to relate the magnetic moment operator fi and the intrinsic 
spin operator S. We have also defined loq = geBg/2mc. The eigenstates ol H are the 
eigenstates of S z : 

H |+z) = co 0 S z \+z) = —r -|+z) = E + |+z) (4.18a) 

H | - z) = a> 0 S z | -z) = - ^ | ■-z) = E_ | - 2 ) (4.18b) 

where we have denoted the energy eigenvalues of the spin-up and spin-down states 
by E + and E_, respectively. 

What happens as time progresses? Since the Hamiltonian is time independent, 
we can take advantage of (4.9): 

U(t) = = e ~ iw °^ ,,h = e ,s ^ /h = R(4> k) (4.19) 

where in the last two steps we have expressed the time-development operator as the 
rotation operator that rotates states about the z axis by angle <p = a> 0 t. Thus we see 
that placing the particle in a magnetic field in the z direction rotates the spin of the 
particle about the z axis as time progresses, with a period T = 2jr/o> 0 . Using the 
terminology of classical physics, we say that the particle’s spin is precessing about 
the z axis, as depicted in Fig. 4.1. However, we should be careful not to carry over 
too completely the classical picture of a magnetic moment precessing in a magnetic 
field since in the quantum system the angular momentum—and hence the magnetic 
moment—of the particle cannot actually be pointing in a specific direction because 
of the uncertainty relations such as (3.75). 

In order to see how we work out the details of quantum dynamics, let s take 
a specific example. With B = £ 0 k, we choose \\fr(0)) = |+x). The state |+x) is a 


Page 131 (metric system) 



116 | 4. Time Evolution 


z 



Figure 4.1 A spin-^ particle, initially in the state |+x), 
precesses about the magnetic field, which points in the z 
direction. 


superposition of eigenstates of S z , and therefore from (4.18) it is a superposition of 
energy eigenstates with different energies. The state at time t is given by 


| ^(0) =*-«■"'/*( 
e ~iE + t/h 

= —JT~ 


Tf l+Z> + Ti 


|+z) + 


-iE_t/h 



e ~ico 0 l/2 


1 + 2 ) + 


giiOQt/2 


l-z) 


(4.20) 


This state does not simply pick up an overall phase as time progresses; it is not a 
stationary state. Equation (4.20) can also be written as 


giMQt 


which is just an overall phase factor times the spin-up state |4-n) that we found in 
(3.98), provided we choose the azimuthal angle <p = co {) t. 

Let’s investigate how the probabilities of being in various spin states and the spin 
expectation values evolve in time. We use the expression (4.20) for | xfr(t)). Note that 


-z> 


(4.21) 


| xj,(t))=e~ iw ° t ' 2 ( 4=1+2) 
, V 2 


\{+z\ir{t))\ 2 


e -i<Dtfll 2 

~7T 


1 

2 


l( — z|l/r(f )>| 2 


e iio 0 t/2 2 


2 

2 


are independent of time, and therefore 


(S z ) = 


1 

2 




(4.22a) 

(4.22b) 


(4.23) 


is also a constant of the motion. 

When we examine the components of the intrinsic spin in the x-y plane, we do 
see explicit time dependence. Since 
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(+X| t/r(f)} 


= (7! <+zl + 7! 1 - 21 ) 

1 1 f e - iw ° t/2 \ 

~ V2 {1 ^ y/l V ) 


e -iu> 0 t/2 

~7T 


l+z) + 



e im 0 t/2 

“7T 



(4.24) 


where in the second line we have used the matrix representations for the states in the 
S z basis, then 


|<+x|^(/))| 2 = cos 2 ^ 


(4.25) 


As a check, note that the probability of the particle being spin up along the x axis is 
one at time t = 0, as required by the initial condition. Similarly, 


<-x|t/r( 0 > = 




V2 


iaiQl/2 

|+Z) + —— |-z) 


e -ito 0 t/2 £ 


V2 


V2 (1 ’ ^ V2 l e iu) ot ft 


e -ia> 0 t/2\ _ ftW 

) = — i sin- 

/ 2 


(4.26) 


and 


I{—x|i/r(/)>| 2 = sin 2 (4.27) 

The sum of the probabilities to be spin up or spin down along x is one for all 
times, since these two states |+x) and |-x> form a complete set and probability 
is conserved. We can determine the average value of S x either as the sum of the 
eigenvalues multiplied by the probabilities of obtaining each of these eigenvalues, 


(S x ) = cos 


2 


ft Vsin 2 ^ 


( n\ n 
\ 2 / = 2 C ° S Ct) ° ? 


(4.28a) 


or from 


(s x ) = mo\s x mt)) 

“ V2 


(** 


o>ot/2 e ~ia> 0 t/2 



h 

= — COS U) 0 t 


(4.28b) 


where we have used the representation lor the bra and the ket vectors and the operator 
in the S. basis. 
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A similar calculation yields 

l(+yWO)l 2 = ' + ”" l>l0> (4.29a) 

l(-yl^(())| 2 = i— -y- — (4.29b) 

and 

(S y ) = | sin co 0 t (4.30) 

All of these results are consistent with the spin precessing counterclockwise around 
the z axis with a period T = Iji/coq, in agreement with our analysis using the explicit 
form (4.19) of the time-evolution operator as a rotation operator. If the charge q of 
the particle is taken to be positive rather than negative, <x> 0 is negative, and the spin 
precesses in a clockwise direction. 

Before going on to examine some examples of spin precession, it is worthwhile 
commenting on the time dependence of the expectation values (4.23), (4.28), and 
(4.30). First, note from (4.16) that 

^-(S z ) = Uf\[H,SM) (4.31) 

at n 

We can see from the explicit form of the Hamiltonian (4.17), which is just a constant 
multiple of S z , that H commutes with S, and therefore (S z ) is time independent 
[as (4.23) shows]. It is interesting to consider this result from the perspective of 
rotational invariance. In particular, with the external magnetic field in the z direction, 
rotations about the z axis leave the spin Hamiltonian unchanged. Thus the generator 
S z of these rotations must commute with H, and consequently from (4.31) (S z ) is a 
constant of the motion. The advantage of thinking in terms of symmetry' (a symmetry 
operation is one that leaves the system invariant) is that we can use symmetry to 
determine the constants of the motion before we actually carry out the calculations. 
We can also know in advance that ( S x ) and ( S v } should vary with time. After all, 
since S x and S y generate rotations about the x and y axes, respectively, and the 
Hamiltonian is not invariant under rotations about these axes, H does not commute 
with these generators. 

EXAMPLE 4.1 Verify that the expectation values (4.28) and (4.30) satisfy 

^-{S x )^j(xlf\[H,S x U) 
dt h 

SOLUTION Since 

| H = coqS. 


Page 134 (metric system) 



4.3 Precession of a Spin-i Particle in a Magnetic Field I 119 


we want to see if 


- 7 ~{S X ) =^(xfr\[co 0 S z , S x ]\f) - ~co 0 (S y ) 
at h 


where we have used 

[L S x ] = ihSy 

one of the fundamental commutation relations of the angular momentum 
operators. Substituting in the expectation values (4.28) and (4.30), we see 
that indeed 


dt 


( S x ) = 


d_ fh 
dt \2 

Hu) 0 


cos co 0 t 
sin (W()f 


= ~CO 0 (S y ) 


THE g FACTOR OF THE MUON 

An interesting application of spin precession is the determination of the g factor of 
the muon. The pion is a spin-0 particle that decays into a muon and a neutrino. The 
primary decay mode, for example, of the positively charged pion is n + ix + + i ),. 
where the subscript on the neutrino indicates that it is a type of neutrino associated 
with the muon. Unlike photons, which are both right- and left-circularly polarized, 
neutrinos are essentially left handed . 4 For a spin-4 particle like the neutrino this 
means that the projection of the angular momentum along the direction of motion 
of the neutrino is only —h/2. There is no +h/2 projection. Conservation of angular 
momentum in the decay of a pion at rest requires that the muon produced in this 
decay, which is also a spin-j particle, Jbe left handed as well (see Fig. 4.2). The muon 
is unstable and decays via q + -* e + + v e + v M , with a lifetime of approximately 2.2 
microseconds in the muon’s rest frame. As a consequence of the weak interactions 
responsible for the decay, the positron is preferentially emitted in a direction opposite 
to the spin direction of the muon, and therefore monitoring the decay of the muon 
gives us information about its spin orientation. If the muon is brought to rest, say 
in graphite, and placed in a magnetic field of magnitude Bq along the z direction 
with the initial spin state spin up along the x axis as in our earlier discussion, the 
spin of the muon will precess. A detector located along the x axis to detect the 
positrons that are produced in the decay should yield a counting rate proportional 


4 The existence of neutrino oscillations indicates that neutrinos have a very small mass. If the 
neutrino mass were exactly zero, neutrinos would be purely left handed. 
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Figure 4.2 (a) Conservation of linear and angular 
momentum requires that the decay of the spin-0 pion in 
its rest frame produces a left-handed /j, + , since the is 
essentially a left-handed particle, (b) The is brought 
to rest with its spin up along the x axis and allowed 
to precess in a magnetic field in the z direction. The 
positrons from the [i + decay are emitted preferentially 
in the opposite direction to the spin of the ft + , 


to (4.25) as the muon’s spin precesses in the magnetic field. Figure 4.3 shows the 
data from a typical experiment that we can use to obtain a value for the g factor (see 
Problem 4.7). The first measurements of this sort were carried out by Garwin et al., 5 
who found g = 2.00 ± 0.10. The best experimental value for g — 2 of the muon, 
good to six significant figures, comes from a spin-precession experiment carried out 
at Brookhaven National Laboratory. 6 There is much interest in measuring the g factor 
of the muon because its accurate determination can provide information about the 
strong and electro-weak interactions at short distances, as well as a detailed test of 
quantum electrodynamics. 

2n ROTATIONS OF A SPIN-| PARTICLE 

As a second illustration of spin precession, let’s consider a beautiful experiment that 
demonstrates that rotating a spin-4 particle through 2n radians causes the state of 
the particle to change sign, as shown in (2.43). At first thought, it might not seem 
feasible to test this prediction since the state of the particle picks up an overall phase 
as the result of such a rotation. However, as we saw in our discussion of Experiment 4 
in Chapter 1, a single particle can have amplitudes to take two separate paths and 
how these amplitudes add up, or interfere, depends on their relative phases. Werner 
et al. 7 used neutrons as the spin-^ particles and constructed an interferometer of the 


5 R. L. Garwin, L. M. Lederman, and M. Weinrich, Phys. Rev. 105, 1415 (1957). 

6 This measurement [G. W. Bennett et al., Phys. Rev. Lett. 92, 16 1 8102 (2004)] takes advantage 
of the fact that the difference between the frequency at which the muon circles in a constant 
magnetic field (its cyclotron frequency) and the frequency of spin precession for a muon initially 
polarized parallel or antiparallel to its direction of motion is proportional to g — 2. 

1 S. A. Werner. R. Colella, A. W. Overhauser. and C. F. Eagen, Phys. Rev. Lett. 35. 1053 (1975). 
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Precession frequency = 807.5 kHz 



Figure 4.3 Data on the precession of a muon 
in a magnetic field of magnitude 60 gauss. 
Adapted from J. Sandweiss et al., Phys. Rev. 
Lett. 3 0, 1002(1973). 



?C 3 

5 c 2 



(a) 

Figure 4.4 (a) A schematic diagram of the neutron interferometer and (b) the difference 
in counts between the counters C 3 and C 2 as a function of the magnetic field strength. 
Adapted from Werner et al., Phys. Rev. Lett. 35, 1053 (1975). 


type first developed for X-rays. Their schematic of the interferometer is shown in 
Fig. 4.4a. A monoenergetic beam of thermal neutrons is split by Bragg reflection 
from a crystal of silicon into two beams at A, one of which traverses path ABD and 
the other path ACD. A silicon crystal is used to deflect the beams at B and C, as well 
as to recombine them at D. As in a typical interferometer, there will be constructive 
or destructive interference depending on the path difference between the two legs 
ABD and ACD. The relative phase of the two beams can be altered, however, by 
allowing one of the beams to pass through a uniform magnetic field. As indicated 
by (4.21), there will be an additional phase difference of 

^ = = (4.32) 

2 Me 

introduced, where M is the mass of the nucleon, B 0 is the magnitude of the uniform 
field on the path AC, and T is the amount of time the beam spends in the magnetic 
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field . 8 In the experiment, the magnitude of the magnetic field strength could be varied 
between 0 and 500 gauss. The difference in B 0 , which we call A B, needed to produce 
successive maxima is determined by the requirement that 


geAB 
2 Me 


T = 4n 


(4.33) 


Notice that we have used the fact that a rotation by 47 r radians is required to return the 
overall phase of spin-y ket to its original value. As shown in Fig. 4.4b, Werner et al. 
found AB = 62 ± 2 gauss in their experiment. If rotating a ket by 2 n radians were 
sufficient to keep the phase of the ket the same, the observed value of A B would have 
been one half as large as that found in the experiment. Thus the experimental results 
give an unambiguous confirmation of the unusual prediction (2.43) of quantum 
mechanics for spin- 5 particles. 


EXAMPLE 4.2 The Hamiltonian for a spin -5 particle in a magnetic field 
B = S 0 i is given by 

H = o 0 S x 

where a > 0 = geB 0 j2mc. If initially the particle is in the state 

llK0)> = l+z> 

determine |t//(f)), the state of the particle at time t. 

SOLUTION The time development operator is given by 

0 (t) = e— g-ietoSjct/fi 

Since 

e . iirJ()S : l/h _ e ~iS x 4>/h __ 

where <j> = oo 0 t, the Hamiltonian causes the spin to rotate, or precess, about 
the x axis in this case. In order to work out the action of the time development 


8 Three comments about this expression are in order. (1) Since a neutron is a neutral particle, it 
might seem strange for it to have a magnetic moment at all. That g/2 = —1.91 is an indication that 
the neutron is not itself a fundamental particle, but rather is composed of more fundamental charged 
constituents called quarks. (2) In nuclear physics, magnetic moments arc generally expressed in 
terms of the nuclear magneton where the mass M in (4.32) is really the mass of the proton. Since 
the mass of the proton differs from the mass of the neutron by less than 0.2 percent, we can ignore 
this distinction unless we are interested in results to this accuracy. (3) The time T can be expressed 
as T = IMIp, where p is the momentum of the neutron and I is the path length in the magnetic 
field region. We can then use the de Broglie relation p = h/X [see (6.56)1 to express this time in 
terms of the wavelength of the neutron. It is actually X that is determined when selecting the energy 
of the neutron beam using the techniques of crystal diffraction. 
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operator on the state |^(0)), we need to express \f(0)) in terms of the 
eigenstates |+x) and |-x) of the Hamiltonian. Note that 

IVr(0)) = l+z> 

= |+x>(+x|+z> + |-x)(-x|+z) 

= 7! i+s, + 7i | -’ ,) 

I 

Thus 




h 

= - COS COr.t 
2 

This is the same result that we obtained for { S x > in (4.28). After all, 
although in this example the magnetic field pointed in the x direction and 
the particle's state was initially spin up along the 2 axis, you could have 
chosen to label these axes the 2 and x axes, respectively, making this example 
problem exactly the same as the example worked out at the beginning of 
this section. The main reason for including this example problem here is to 
emphasize the strategy for working out time dependence when the initial 
state is not an eigenstate of the Hamiltonian, namely, write the initial state 
as a superposition of the eigenstates of the Hamiltonian and then apply the 
time development operator to this superposition. 
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4.4 Magnetic Resonance 


When a spin-1 particle precesses in a magnetic field in the z direction, the probability 
of the particle being spin up or spin down along z doesn’t vary with time, as 
shown in (4.22). After all. the states |+z) and j—z) are stationary states of the 
Hamiltonian (4.17). However, if we alter the Hamiltonian by applying in addition an 
oscillating magnetic field transverse to the z axis, we can induce transitions between 
these two states by properly adjusting the frequency of this transverse field. The 
energy difference E + — E = fuo 0 can then be measured with high accuracy. This 
magnetic resonance gives us an excellent way of determining a> 0 . Initially, physicists 
used magnetic resonance techniques to make accurate determinations of g factors 
and thus gain fundamental information about the nature of these particles. On the 
other hand, with known values for g, one can use the technique to make accurate 
determinations of the magnetic field B {) in which the spin is precessing. For electrons 
or nuclei in atoms or molecules, this magnetic field is a combination of the known 
externally applied field and the local magnetic field at the site of the electron or 
nucleus. This local field provides valuable information about the nature of the bonds 
that electrons in the atom make with neighboring atoms in a molecule. More recently, 
magnetic resonance imaging (MRI) has become an important diagnostic tool in 
medicine. 

The spin Hamiltonian for magnetic resonance is given by 

H = —/t • B = -M-S ■ B = -M -s . cos cot i + B 0 k) (4.34) 
2 me 2mc 

where the magnetic field includes a constant magnetic field in the z direction and an 
oscillating magnetic field in the x direction. As we did for spin precession, we choose 
q — —e and set egB () /2mc — co 0 . We also define egB l /2mc — oj j . The Hamiltonian 
can now be wnitten as 


H = (o 0 S : + tupcos Mi ).S\ (4.35) 

This Hamiltonian is time dependent, so we cannot use the expression (4.9) for the 
time-evolution operator. 9 

I’o determine how spin slates evolve in time, we return to the Schrddinger 
equation (4.8). Let’s take the state of the particle at time t = 0 to be |+z). We will 
work in the S, basis and express i//(; )) in this basis by 


9 If we were to choose our total system to be sufficiently large, including, in this example, the 
energy of both the spin system and the electromagnetic field, we would find that the total energy 
is conserved. Here we are treating the magnetic field as an external field acting on the quantum 
spin system. 
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a ( ait) \ 

* I fit)) -+■ ) (4.36) 

V b(t) ) 

with the initial condition 

(4-37) 

In this basis, the time-development equation H\tf/(t)) = ihd\ir(t))Jdt is given by 

fi / <% o)\ cos tvt \ / a(t) \ / a(f) 

2 \ cos cot —cu 0 / \ b(t) ) \ bit) 

where a(t) = da/dt and bit) = db/dt. This coupled set of first-order differential 
equations cannot be solved exactly. In practice, however, the transverse field B i is 
significantly weaker than the field B 0 in the z direction and therefore the frequency 
o)j is considerably smaller than &> 0 . We can take advantage of this fact to obtain an 
approximate solution to (4.38). 

First, note that if co x — 0, the solution to (4.38) is 

a(t) = a{ 0)e ia) ° ,/2 and b(t)=b( 0)e iWQ,/2 (4.39) 


(4.38) 


in agreement with the time dependence of our earlier results (4.20). This suggests 
that we try writing 

(“">) = (4.40) 

\b(t)J \ d(t)e“°o t / 2 ) 

where we expect that we have included the major part of the time dependence in the 
exponentials. If we substitute (4.40) into (4.38), we obtain 




OJ, 

— cos cot 


/ d(t)e im o' \ 

\c(t)e^'“ l ) , / 


to, / (e'^o+‘^ + d{t) 

4 \ _|_ e -i(a>Q+w)t^ 


(4.41) 


Unless to is chosen to be very near to co 0 , both the exponentials in the second line 
of (4.41) are rapidly oscillating functions that when multiplied by a more slowly 
oscillating function such as c(t ) or d(t), whose time scale is set by co h will cause 
the right-hand side of (4.41) to average to zero . 10 However, if to is near to 0 , the tenns 
oscillating at to 0 + co can be neglected with respect to those oscillating at co 0 — to, 
and these latter terms are now oscillating sufficiently slowly that c and d vary with 


10 In a typical electron spin resonance (ESR) experiment in a field of 10 4 gauss, w 0 ~ 10 11 Hz, 
while for nuclear magnetic resonance (NMR) with protons in a comparable field, a> 0 ~ 10 8 Hz. 


Page 141 (metric system) 



126 | 4. Time Evolution 


time. Here we will solve for this time dependence when ca is equal to a> 0 , the resonant 
condition, and leave the more general case as a problem. 

Setting to = mq and neglecting the exponentials oscillating at 2oj 0 , we obtain 


( c(0\ _ coifd(t)\ 

\m) ~ 4 Vc(0/ 


(4.42) 


If we take the time derivative of these two coupled equations and then use (4.42) to 
eliminate the terms involving a single derivative, we obtain the uncoupled second- 
order differential equations 



The solution to (4.43) satisfying the initial condition c(0) = 1 and d( 0) = 0 [see 
(4.42)] is c(t) = cos(co l t/4) and d(t) = —i sin(e t > 1 r/4). Thus the probability of find¬ 
ing the particle in the state |—z) at time t is given by 

\(-z\f(t))\ 2 = b*(t)b(t) = d*(t)d(t) = sm 2 ^ (4.44a) 

4 

for a spin-4 particle that initially resides in the state j+z) at t = 0. Similarly, the 
probability of finding the panicle in the state |+z) is given by 

| (—z| | 2 = a*(t)a(t) = c*(t)c{t) = cos 2 ^ (4.44b) 

4 

Of course, these two probabilities sum to one, since these two states form a complete 
set and probability is conserved in time. If a particle initially in the state |+z) 
makes a transition to the state |—z), the energy of the spin system is reduced by 
E + — = hojty assuming to 0 > 0. This energy is added to the electromagnetic 

energy of the oscillating field that is stimulating the transition. For t between zero 
and 2 n/to { , the probability of making a transition to the lower energy state grows 
until b*(t)b(t) = 1 and a*(t)a{t) — 0. Then the particle is in the state |— z). Next 
for t between 2n/w\ and 4jr/oj|. the probability of being in the lower energy state 
decreases and the probability of being in the higher energy state grows as the system 
absorbs energy back from the electromagnetic field. This cycle of emission and 
absorption continues indefinitely (see Fig. 4.5). 

As noted earlier, there is a probability of inducing a transition between the two 
spin states even when the frequency to is not equal to a> 0 . If the system is initially 
in the spin-up state, the probability of being in the lower energy spin-down state at 
time t is given by Rabi’s formula (see Problem 4.9), 




arJ4 . J (a>o - ") 2 + <»?/ 4 

—4--— sin" 2 - 1 


(co o - to) 2 + to]/4 


(4.45) 
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Figure 4.5 The probabilities \{+z\ifr (f))| 2 (solid 
line) and \{~z\tfr(t))\ 2 (dashed line) for a spin-i 
particle that is in the state |+z) at t = 0 when the 
time-dependent magnetic field in the x direction is 
tuned to be resonant frequency. 



Figure 4.6 A sketch of the magnetic-resonance tran¬ 
sition probability as a function of the frequency io of 
the time-dependent magnetic field. 


The maximum probability of transition is plotted as a function of o> in Fig. 4.6. 
Monitoring the losses and gains in energy to the oscillating field as a function of a> 
gives us a nice handle on whether the frequency of this field is indeed the resonant 
frequency of the spin system. Notice in (4.45) that making B\ smaller makes uq 
smaller and the curve in Fig. 4.6 narrower, permitting a more accurate determination 
of COq. 

In practice, the physical spin system consists of a large number of particles, 
either electrons or nuclei, that are in thermal equilibrium at some temperature T. 
The relative number of particles in the two energy states is given by the Boltzmann 
distribution, so slightly more of the particles are in the lower energy state. There 
will be a net absorption of energy proportional to the difference in populations of 
the two levels, since the magnetic field induces transitions in both directions. Of 
course, if we just sit at the resonant frequency, the populations will equalize quickly 
and there will be no more absorption. Thus, in practice, it is necessary' to move the 
system away from resonance, often by varying slightly the field B 0 , thus permitting 
thermal equilibrium to be reestablished. In the case of nuclear magnetic resonance, 
the nuclear magnetic moments are located at the center of the atoms, surrounded by 


Page 143 (metric system) 


128 | 4. Time Evolution 


electrons, and are relatively isolated thermally from their surroundings. Therefore, it 
can be difficult to get the nuclear spins to “relax" back to thermal equilibrium, even 
when the resonance condition no longer persists. In this case thermal contact can be 
increased by doping the sample with paramagnetic ions. 

4.5 The Ammonia Molecule and the Ammonia Maser 


As our last example in this chapter of a two-state system, we consider the ammonia 
molecule. 11 At first glance, the ammonia molecule does not seem a promising 
hunting ground for a two-state system. After all, NH 3 is a complicated system of 
four nuclei and ten electrons interacting with each other to form bonds between 
the atoms, making the stable state of the molecule a pyramid with three hydrogen 
atoms forming the base and a nitrogen atom at the apex (see Fig. 4.7). Here we won’t 
worry about all of this internal dynamics, nor will we concern ourselves with how the 
molecule as a w hole is rotating or translating. Rather, we wi11 take the molecule to be 
in a fixed state as regards all of these degrees of freedom and focus on the location of 
the nitrogen atom; namely, is the nitrogen atom above or below the plane formed by 
three hydrogen atoms? The existence of a reasonably well-defined location for the 
nitrogen atom indicates that there is a potential well in which the nitrogen atom finds 
it energetically advantageous to reside. However, the geometry of the molecule tells 
us that if there is a potential well above the plane, there must be a similar well below 
the plane. Which state does the nitrogen atom choose? Nature likes to find the lowest 
energy state, so we are led to solve the energy eigenvalue problem to determine the 
allowed states and energies of the system. 

We introduce two kets: 

11) = jA 7 above the plane) and |2) = \N below the plane) (4.46) 

and construct the matrix representation of the Hamiltonian using these two states, 
depicted in Fig. 4.7, as basis states. The symmetry of the two physical configurations 
suggests that the expectation value of the Hamiltonian in these states, an energy that 
we denote by should be the same for the two states. Thus 

(\\H\l) = (2\H\2) = E 0 (4.47) 

where H is the Hamiltonian of the system. What about the off-diagonal matrix 
elements? If we look back to our discussion of time evolution of the spin system 
in magnetic resonance, we see that when we set the off-diagonal matrix elements 
of the Hamiltonian in (4.38) equal to zero, the spin-up and spin-down states were 
stationary states; if the system were in one of these states initially, it remained in 


11 Our discussion of the ammonia molecule as a two-state system is inspired by the treatment 
in vol. 3 of The Feynman Lectures on Physics. 
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(b) 


Figure 4.7 The two states of the am¬ 
monia molecule with (a) the nitrogen 
atom above the plane in state 11) and 
(b) the nitrogen atom below the plane 
in state |2). 


that state forever, as (4.39) shows. For the ammonia molecule, the vanishing of the 
off-diagonal matrix elements, such as (2\H\\), would mean that a molecule initially, 
for example, in the state 11}, with the N atom above the plane, would remain in that 
state. Now, if the potential barrier between the two wells were infinitely high, there 
would be no chance that a nitrogen atom above the plane in state j 1) would be found 
below' the plane in state |2). However, although the energy barrier formed by the 
three hydrogen atoms is large, it is not infinite, and there is a small amplitude for 
a nitrogen atom to tunnel between the two states. This means that the off-diagonal 
matrix element (2\H\l) is nonzero. We will take its value to be —A. Thus in the 
ID-2) basis 


/(1|//|1) <l|tf|2}\ = / £ 0 — A \ 

\(2|H|1) (2|H|2) / V-A E 0 ) 


(4.48) 


where A is a positive constant. We will see that this sign for A is required to get the 
correct disposition of the energy levels. Note that if, as w r e have presumed, the off- 
diagonal matrix elements are real, Hermitieity of H , as well as the symmetry' of the 
situation, requires that they be equal. In principle, if we were really adept at carrying 
out quantum mechanics calculations for molecules, we would be able to calculate 
the value of A from first principles. We can think we understand all the physics of 
the electromagnetic interactions responsible for holding the molecule together, but 
NH , is composed of a large number of particles and no one is able to work out all 
the details. We can think of (4.48) as a phenomenological Hamiltonian where the 
value for a constant such as A must be determined experimentally. 

We are now ready to determine the energy eigenstates and eigenvalues of //. The 
energy eigenvalue equation 


HW) = E\f) 

in the )1)-|2) basis is given by 

/ E 0 -AWdhm =E 
V A Eg / \ (2|i/r) / V (2|f) / 


(4.49) 


(4.50) 
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2 A 


E 0 + A 


Eo~A Figure 4.8 The two energy levels of the ammonia molecule. 


The eigenvalues are determined by requiring 




(4.51) 


which yields E = E 0 ± A. We will denote the energy eigenstate with energy 
£/ = E 0 - A by \I). Substituting the eigenvalue into (4.50) shows that (1|/) = (2|7), 
so that we may write 12 





(4.52) 


Energy eigenstate | II) with energy E n = E 0 + A satisfies (11 //) = — (2| //) and thus 
may be written as 


|//> = -Ll>--4l2> (4-53) 

V2 s/2 

The existence of tunneling between the states |1) and |2) has split the energy 
states of the molecule into two states with different energies, one with energy 
E 0 - A and the other with energy £ 0 + A, as shown in Fig. 4.8. The wavelength 
of the electromagnetic radiation emitted when the molecule makes a transition 
between these two energy states is observed to be 12 cm, corresponding to an energy 
separation E n — E l = hv = hc/X of 10~ 4 eV. This small energy separation is to 
be compared with a typical spacing of atomic energy levels that is on the order of 
electron volts, requiring optical or uv photons to excite the atom. Molecules also have 
vibrational and rotational energy levels, but these modes are excited by photons in 
the infrared or far infrared, respectively. Exciting an ammonia molecule from state 
I /) to state [//) requires electromagnetic radiation of an even longer wavelength, 
in the microwave part of the spectrum. The smallness of this energy difference 
£// — £/ = 2A is a reflection of the smallness of the amplitude for tunneling from 
state 11) to |2). 

Notice that neither in energy eigenstate |7) nor | II) is the nitrogen atom located 
above or below the plane formed by the three hydrogen atoms. Under the transforma¬ 
tion 11) |2) that flips the position of the nitrogen atom, the state |/) is symmetric, 

that is, |/} -> |/), while the state \II) is antisymmetric, thatis, |//} -|//). We can, 


12 In the normalization of the state, we have neglected the nonzero amplitude (2| 1) because of 
its small magnitude. 
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however, localize the nitrogen atom above the plane, for example, by superposing 
the energy eigenstates: 


| 1 > = 




(4.54) 


If |yr(0)) = 11), then 


W(t))=e~~ iH, l h 

e -i(E 0 ~A)t/h 


T! |,)+ v! 1 "’ 


= e 


V2 


-i(E 0 - A)t/h 


I/> + 


-i(E 0 +A)t/k 


v/2 


|//> 


s/2 


I/> + 


-2 iAt/h 

7T 


■i in 


(4.55) 


where in the last step we have pulled an overall phase factor out in front of the 
ket. Since the initial state of the molecule is a superposition of energy states with 
different energies, the molecule is not in a stationary state. We see that the relative 
phase between the two energy eigenstates changes with time, and thus the state of the 
molecule is really varying in time. The motion is periodic with a period T determined 
from 2 AT/h = 2tt. What is the nature of the motion? When t — 772, the relative 
phase is re and 


= (overall phase) |2) (4.56) 

The nitrogen atom is located below the plane. Thus the nitrogen atom oscillates back 
and forth above and below the plane with a frequency v = \/T = Ajrxh = 2A/h. 
This frequency, which equals 24 GHz, is the same as the frequency of the electro¬ 
magnetic radiation emitted when the molecule makes a transition between states \1I) 
and |7). 

THE MOLECULE IN A STATIC EXTERNAL ELECTRIC FIELD 


\x//(T/2)) = (overall phase) 


72 ' ' s/2 


i U) 


Since the valence electrons in the ammonia molecule tend to reside somewhat closer 
to the nitrogen atom, the nitrogen atom is somewhat negative and the hydrogen atoms 
are somewhat positive. Thus the molecule has an electric dipole moment ft ,, directed 
away from the nitrogen atom toward the plane formed by the hydrogen atoms. Just 
as the magnetic dipole moment associated with its spin angular momentum allowed 
us to interact with a spin-) particle in Stem-Gerlach or spin-precession experiments 
by inserting it in a magnetic field, we can interact with the ammonia molecule by 
placing it in an external electric field E, as indicated in Fig. 4.9. There is an energy 
of interaction with the electric field of the form ~fi e ■ E that will differ depending 
on whether the nitrogen atom is above the plane in state |1) or below the plane in 
state |2). The presence of this electric field modifies the matrix representation of the 
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Figure 4.9 The electric dipole mo¬ 
ment n e of the ammonia molecule in 
(a) state 11) and (b) state |2). In the pres¬ 
ence of an external electric field E, the 
two states acquire different energies, 
as indicated in (4.57). 


Hamiltonian in the | J)-|2) basis: 1 -’ 

^ /<U H\D <l|ff|2)\ = /£ 0 + ^|E| -A \ 

\{2\H\l) (2\H\2) ) V —A E 0 -n e \E\) 

where we assume the external field is sufficiently weak that it does not affect the 
amplitude for the nitrogen atom to tunnel through the barrier. The eigenvalues are 
determined by the requirement that 


E 0 + /.i e \E\ —E —A 

-A E 0 — n e \E\—E 


(4.58a) 


or 


E = E 0 ± /at,|E|) 2 +A 2 


(4.58b) 


See Fig. 4.10. Most external electric fields satisfy jx e \E\ <<C A, so we can expand the 
square root in a Taylor series or a binomial expansion to obtain 


E = E 0 ± A ± 


1 (M.[E|) 2 

2 A 


(4.59) 


As in the Stern-Gerlach experiments where we used an inhomogeneous magnetic 
field to make measurements of the intrinsic spin and select spin-up and spin-down 
states, here we use an inhomogeneous electric field to separate NH 3 molecules into 
those in states )/) and | //). If we call the direction in which the electric field increases 
the z direction, then the force in that direction is given by 


a r c/AED 2 

dz ~ 2A 


(4.60) 


13 It is customary to use fi e for the electric dipole moment to avoid confusion with the symbol 
for momentum. We also use |E| for the magnitude of the electric field to avoid confusion with the 
symbol for energy. 
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E 



Figure 4.10 The energy levels of the 
ainmonia molecule in an external electric 
field. 



I/) 


l//> 


Figure 4.11 A beam of ammonia 
molecules passing through a region 
in which there is a strong electric 
field gradient separates into two 
beams, one with the molecules in 
state 17} and the other with the 
molecules in the state \II). 


Notice that the minus sign in (4.59) corresponds to the state with energy £ 0 — A in the 
absence of the external electric field. Hence a molecule in state 1 1) will be deflected 
in the positive z direction, while a molecule in the state | //) will be deflected in the 
negative z direction, as shown in Fig. 4.11. Because of the small value of A in the 
denominator in (4.60), it is relatively easy to separate a beam of ammonia molecules 
in, for example, a gas jet by sending them through a region in which there is a large 
gradient in the electric field. 

THE MOLECULE IN A TIME-DEPENDENT ELECTRIC FIELD 

We are now ready to induce transitions between states |7) and | II) by applying a 
time-dependent electric field of the form E = E 0 cos cot. There will be a resonant 
absoiption or emission of electromagnetic energy, provided that hco is equal to the 
energy difference 2A between the two states. This sounds similar to the magnetic 
resonance effects that we treated in the previous section, and in fact the mathematics 
describing the two problems is essentially identical. To see this, consider the Hamil¬ 
tonian in the |1)-|2) basis as given in (4.57) with a time-dependent electric field. If 
we transform to the | /)-1//) basis, we obtain (see Problem 4,10) 

</|H|//>\ = / E 0 — A mJE 0 | cos col \ 

* V {U\H\f) (fl\H\II) ) \ tie |E 0 I COS cot Eq + A ) ' 

Comparing this matrix with that for the Hamiltonian in (4.38) of a spin-4 particle in 
an oscillating transverse magnetic field, we see that it is possible to draw a one-to-one 
correspondence between each term in the two matrices: £ + = fko 0 /2 -* E 0 + A, 
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SL = — fia> (} /2 —* Eq — Ai and hco\/2 —> /iJEol- Thus one can follow the steps 
leading to the probability of making a transition between the spin-up and spin- 
down states in (4.44) and apply them to this new problem to obtain the probability 
of making a transition between stales (/) and (//). Therefore, at resonance the 
probability of finding the ammonia molecule in state |7) for a molecule initially 
in the state 1 11} at time t = 0 is 

|(/|iA(r)>| 2 = sin 2 ( 4 . 62 ) 

In 

We can combine the results of this section and the preceding one to provide a 
description of a simple ammonia maser (Microwave Amplification by Stimulated 
Emission of Radiation). First we use an inhomogeneous electric field to select a 
beam of ammonia molecules that are in the upper energy state [//); then we send 
this beam into a microwave cavity whose resonant frequency is tuned to 24 GHz, 
the resonant frequency of the ammonia molecule. If the molecules spend a time T 
in the cavity such that /r e |E 0 | T/2h = n/2, then according to (4.62) they will all 
make transitions from state \11) to state |/). The molecular energy released in this 
transition is fed into the cavity, where it can be used as microwave radiation. 14 

4.6 The Energy-Time Uncertainty Relation 


As our last topic on time evolution, let’s consider the energy-time uncertainty relation 


AEAt > - 
~ 2 


(4.63) 


The uncertainty relation is somewhat of a misnomer; unlike our previous uncertainty 
relations such as (3.74), only AE in (4.63) is a legitimate uncertainty. It reflects the 
spread in energy characterizing a particular state. To see the meaning of A/, consider 
an example. Let's return to the ammonia molecule that is initially in state 11), with the 
nitrogen atom above the plane. As (4.55) shows, this state is not an energy eigenstate 
but a superposition of two energy eigenstates with different energies. The uncertainty 
in the energy of a molecule in this state is given by 


AE 


= (<£ 2 ) “ ( E } 2 ) 


1/2 


= 15 (^o + A) 2 + | (E 0 — A) J 
= A 


§(E 0 + A) + j(E 0 -A) 


" 1-2 


1/2 


(4.64) 


14 The key element missing from our discussion of the maser is the coherent nature of the 
radiation that it produces. So far we have treated the electromagnetic field as a classical field and 
have not taken into account its quantum properties, that is, that it is really composed of photons. 
We will examine this issue in more detail in Chapter 14. 
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We can express the time evolution of the state (4.55) in terms of the uncertainty 
AE as 


| if(t)) = (overall phase) 



e -2i&Et/h 



(4.65) 


How long do we have to wait before the state of the molecule changes? The answer 
lo this question is the quantity we call At. To be sure the state (4.65) has changed* 
we need to be sure the relative phase between the energy eigenstates J7) and \I1) 
has changed significantly from its value of zero at t — 0 to something of order unity. 
This requires that the time interval At satisfy 2AEAt/hk, 1. which is in accord 
with (4.63). 15 In (4.55) we saw that the time required for the nitrogen atom to appear 
below the plane in state |2) is determined by the requirement that the relative phase 
change by n. Thus the time interval At determined by (4.63) is roughly one-third of 
the time required for the nitrogen atom to oscillate from above to below the plane. 

Notice that if, instead of being in a superposition of energy eigenstates with 
different energies, the state of the molecule had been an energy eigenstate, there 
would be a definite value for the energy of the molecule, and hence AE = 0. But in 
this case, the ket would pick up only an overall phase as time evolved, and the time 
interval At required for the state to change would be infinite. An energy eigenstate 
is really a stationary state. 

Our discussion and example should make clear that At is not an uncertainty at all. 
Time in nonrelativistic quantum mechanics is just a parameter and not a dynamical 
variable like energy, angular momentum, position, or momentum with which there 
may be an uncertainty depending on the state of the system. When we discuss the 
state of the system at time r, there is no inherent limit on how accurately we can 
specify this time. 

In the example we chose a particular initial state | ifr) and then examined the 
length of the evolutionary time At for that state to change. Now that we understand 
the meaning of the uncertainty relation (4.63), we can turn this around slightly. An 
atom (or an ammonia molecule) in an excited-energy state will not remain in this 
state indefinitely, even if undisturbed by any outside influence. It will decay to lower 
energy states with some lifetime r. In Chapter 14 we will see how to calculate the 
lifetime for excited states of the hydrogen atom using the Hamiltonian arising from 
the interactions of charged particles with the electromagnetic field. Thus an excited 
state is not a stationary state, and the lifetime x sets a natural evolutionary time for 
that state. Therefore, from (4.63) there must be an uncertainty in the energy of the 
excited state given by AE ~ h/x. Photons emitted in this transition will have not 


15 We have taken the lower limit in this example as an approximate equality since we have 
somewhat arbitrarily chosen to say that the system has changed when the phase in (4.65) reaches 
one. A more formal derivation of (4.63) and corresponding specification of A t are given in 
Example 4.3. 
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a definite energy but rather a spread in energies. This is the origin of the natural 
linewidth (sec Problem 4.16). 


EXAMPLE 4.3 Consider any observable .4 associated with the state of the 
system in quantum mechanics. Show that there is an uncertainty relation of 
the form 


A E 


/ A A 


\d{A)/dt\ 


h 

> — 
~ 2 


provided the operator A does not depend explicitly on time. The quantity 
AA/\d{A)/dt\ is a time that we may call At. What is the physical signifi¬ 
cance of At? 

SOLUTION Recall that [A, B] = iC implies that AAA# > j(C)|/2. Start 
wfith the commutator [A, H |; then 

1 , 


But since 


then 


or 


If we define 


AAAE > -\{f\ [A, H]\x/f)\ 


d ^ = ‘-mH,Aw 

dt h 


AAAE> - 

"" 2 


d(A) 


dt 


A E 




A A 


\\d(A)/dt\J 


> 


At 


A A 


\d(A)/dt\ 


then 


AAAt > - 
^ 2 

The time At is the time necessary for the expectation value to change by 
an amount on the order of the uncertainty. Thus it is the time you need to wait 
to be sure that the results of measuring A have really changed. For example, 
for position, if Ax = 1 cm andd(x}/dt — lmm/s, then Ax/\d(x)/dt\ = 10s, 
which is the time necessary' for (x) to shift by an amount Ax. 
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4.7 Summary 

Time development is where much of Ihe action occurs in quantum mechanics. To 
move states forward in time, we introduce a time-evolution operator U (t) so that 

W(t)) = O(t)lf(0)) (4.66) 

In order for probability to be conserved as time evolves, 

= 0)l^(0)) (4.67) 

and consequently the operator 0 ( t ) must be unitary: 

U\t)U(t) = 1 (4.68) 

The Hamiltonian H, the energy operator, enters as the generator of time translations 
through the infinitesimal time-evolution operator: 

U {dt) = 1 — — Hdt (4.69) 

h 

The unitarity requirement (4.68) then dictates that the Hamiltonian is Hermitian. 
The time-evolution operator obeys the differential equation 

HUU) = ih—U(t) (4.70) 

dt 

leading to the Schrodinger equation: 

H\f(t)) = ih^-m)) (4.71) 

dt 

A particularly useful solution to (4.70) occurs when the Hamiltonian is indepen¬ 
dent of time, in which case the time?development operator is given by 

U(t) = e ~ ifltlh (4.72) 

The action of the time-development operator (4.72) on an energy eigenstate |£) is 
given by 

e-i»t/h\ E ) - e ~ iEtl ^\E) (4.73) 

showing that a single energy eigenstate just picks up an overall phase as time evolves 
and is therefore a stationary state. Time evolution for a state \rjr) that can be expressed 
as a superposition of energy eigenstates as 

W(0)> = |£„)(£„ 1,^(0)) (4.74) 
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is given by 

\f{t))=e- i " t/h Y J \En)(En\fm 

n 

= ^e-^ /6 |£„i(£„|f(0)) (4.75) 

n 

When the superposition (4.74) involves states with different energies, the relative 
phase between the energy eigenstates changes with time. The time At (the evolu¬ 
tionary time) necessary for the system to change with time in this case satisfies 

AEAt > — (4.76) 

“2 

where A E is the usual uncertainty in energy for the state |t/r). 

Expectation values satisfy 

AU(t )) + (<Mf)I^V(0> (4.77) 

at n dt 

which tells us that observables that do not explicitly depend on time will be constants 
of the motion when they commute with the Hamiltonian. 

Although this chapter is devoted to time evolution, the similarity between the 
operators that generate rotations [see (3.10)] and the operator that generates time 
translations [see (4.72)] is striking. Or compare the form for an infinitesimal rotation 
operator R(d<j> n) = 1 — iJ n d<p/h for rotations by angle dtp about the axis specified 
by the unit vector n with the infinitesimal time translation operator (4.69). We can 
actually tie the rotation operator and the time-evolution operator together with a 
common thread—namely, symmetry. A symmetry operation is one that leaves the 
physical system unchanged, or invariant. For example, if the Hamiltonian is invariant 
under rotations about an axis, the generator of rotations about that axis must commute 
with the Hamiltonian. But (4.77) then tells us that the component of the angular 
momentum along this axis is conserved, since its expectation value doesn’t vary in 
time. Also, if the Hamiltonian is invariant under time translations, which simply 
means that H is independent of time, then of course energy is conserved. We will 
have more to say about symmetry, especially in Chapter 9, but this is our first 
indication of the important connection between symmetries of a physical system 
and conservation laws. 

Problems 


4.1. Show that unitarity of the infinitesimal time-evolution operator (4.4) requires 
that the Hamiltonian H be Hermitian. 

4.2. Show that if the Hamiltonian depends on time and [H(t\), H(t 2 )\ = 0, the time- 
development operator is given by 
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U (t ) — exp 




dt'H(t') 


4.3. Use (4.16) to verify that the expectation value of an observable A does not 
change with time if the system is in an energy eigenstate (a stationary state) and A 
does not depend explicitly on time. 


4.4. A beam of spin-j particles with speed v () passes through a series of two 
SGz devices. The first SGz device transmits particles with S z = h/2 and filters 
out particles with S z = -h/2. The second SGz device transmits particles with 
S z = —h/2 and filters out particles with S’, = h/2. Between the two devices is a 
region of length / 0 in which there is a uniform magnetic field Z? 0 pointing in the 
x direction. Determine the smallest value of 7 0 such that exactly 25 percent of the 
particles transmitted by the first SGz device are transmitted by the second device. 
Express your result in terms of co 0 = egB 0 /2mc and i> 0 . 

4.5. A beam of spin -\ particles in the |+z) state enters a uniform magnetic field B 0 
in the x-z plane oriented at an angle 9 with respect to the z axis. At time T later, the 
particles enter an SGy device. What is the probability the particles will be found with 
S y = h/ 2? Check your result by evaluating the special cases 9 = 0 and 9 = n/2. 

4.6. Verify that the expectation values (4.23), (4.28), and (4.30) for a spin-1 particle 
precessing in a uniform magnetic field B {) in the z direction satisfy (4.16). 


4.7. Use the data given in Fig. 4.3 to determine the g factor of the muon. 


4.8. A spin-1 particle, initially in a state with S n = h/2 with n = sin 9 i + cos 0 k, is 
in a constant magnetic field B 0 in the z direction. Determine the state of the particle 
at time t and determine how (S x ), (S y ), and (5,) vary with time. 

4.9. Derive Rabi’s formula (4.45). 

f.j 

4.10. Express the Hamiltonian (4.57) for the ammonia molecule in the 17)-| //) basis 
to obtain (4.61). Assume the electric field E = E 0 cos cot. Compare this Hamiltonian 
with that for a spin-1 particle in a time-dependent magnetic field that appears 
in (4.38) and deduce the form for the probability of finding the molecule in state 
|/) at time t if it is initially placed in the state |/7); that is, what is the analogue of 
Rabi’s formula (4.45) for the ammonia molecule? 


4.11. A spin-1 particle with a magnetic moment fi = (gq/2mc)S is situated in a 
magnetic field B = R 0 k in the z direction. At time t = 0 the particle is in a state with 
S y = h [see (3.115)]. Determine the state of the particle at time t. Calculate how the 
expectation values ( S x ), {S y ), and (.S',) vary in time. 

4.12. A particle with intrinsic spin one is placed in a constant external magnetic field 
B 0 in the .r direction. The initial spin state of the particle is |i/r (0)) = 11, 1), that is. 
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a state with S, = h. Take the spin Hamiltonian to be 

H = (OqS x 

and determine the probability that the particle is in the state |1, -1) at time t. 
Suggestion: If you haven't already done so, you should first work out Problem 3.15 
to determine the eigenstates of S x for a spin-1 particle in terms of the eigenstates 
of S z . 

4.13. Let 

/ £ 0 0 A \ 

0 E x 0 
\A 0 E 0 ) 

be the matrix representation of the Hamiltonian for a three-state system with basis 
states 11), 12), and |3). 

(a) If the state of the system at time t = 0 is |yV(0)) = |2), what is |i j/(t))7 

(b) If the state of the system at time r = 0 is |i/r(0)) = |3), what is \xfr(t))7 

4.14. The matrix representation of the Hamiltonian for a photon propagating along 
the optic axis (taken to be the z axis) of a quartz crystal using the linear polarization 
states |x) and | y) as basis is given by 

0 -iE 0 \ 

iE 0 0 ) 

(a) What are the eigenstates and eigenvalues of the Hamiltonian? 

(b) A photon enters the crystal linearly polarized in the x direction, that is 
\f(0)) = |x). What is the | x//(t)), the state of the photon at time t? Express 
your answer in the |x)-|y) basis. Show that the photon remains linearly 
polarized as it travels through the crystal. Explain what is happening to the 
polarization of the photon as time increases. 

4.15. If the Hamiltonian for a spin-4 particle is given by 

H = a>oS x 

and at time t = 0 | xfr (0)) = | \, \), determine the probability that the particle is in the 
state ||, — |) at time /. Evaluate this probability when t = n /coq and explain your 
result. Suggestion: See Problem 3.23 for the eigenstates of S x . 

4.16. The lifetime of hydrogen in the 2 p state to decay to the I s ground state is 
1.6 x 1CP 9 s [see (14.169)]. Estimate the uncertainty A£ in energy of this excited 
state. What is the corresponding linewidth in angstroms? 


H 


|x)-|y) basis 
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CHAPTER 5 


A System of Two Spin-1/2 Particles 


Let’s turn our attention to systems containing two spin-1 particles. For definiteness, 
we focus initially on the spin-spin interaction of an electron and a proton in the 
ground state of hydrogen, which leads to hyperfine splitting of this energy level. 
We will see that the energy eigenstates are also eigenstates of total spin angular 
momentum with total-spin zero and total-spin one. The spin-0 state serves as the 
foundation for a discussion of the famous Einstein-Podolsky-Rosen paradox and the 
Bell inequalities. The experimental tests of the predictions of quantum mechanics 
on two-particle systems such as the spin-0 state have profound implications for our 
understanding of the nature of reality. 

5.1 The Basis States for a System of Two Spin-^ Particles 


What are the spin basis states for a system of two spin-particles, such as an electron 
and a proton? A “natural” basis set is to label the states by the value of S z for each 
of the particles: .&• 


|+z,+z) |+z, -z) |-z, +z) j —z, —z) (5.1) 

where the first element in these kets indicates the spin state of one of the particles 
(particle 1) and the second element indicates the spin state of the other particle 
(particle 2). In the notation of Chapter 3 

|±z, ±z) = |ij = |, m l = i|, .Vt = j, rti'i = ij) (5.2) 

Another basis set we could choose—albeit one that looks somewhat less appealing— 
is to label one of the particles by its value of S x and the other by its value of S z : 

I+X, +z) |+x, -z) l-x, +z) |—x, —z) (5.3) 

141 
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In fact, since these are both complete basis sets, we must be able to superpose the 
kets in (5.1) to obtain those in (5.3). For example. 


|+x, +z) = 


1 , , 1 , 
|+ z , +z) H——| 
V2 V2 


-z, +z) 


(5.4) 


Another way to transform from the basis set (5.1) to the basis (5.3) is to rotate the 
spin of the first particle, leaving the second fixed. Rotating the spin state of particle 1 
by tt/2 radians counterclockwise about the y axis transforms, for example, the states 
|±z, +z) into the states |±x, +z). We denote the three generators of rotations for 
particle 1, the angular momentum operators, by S }x , S ]y , and S iz , or. in vector form, 
Sj, where 


Si — 5 u .i + S lv j + 5 l2 k 


(5.5) 


Similarly, we can rotate the spin state of particle 2 with the three generators S 2x , 
S 2)n and S 2z , or, in vector form, S 2 - Since we can rotate the spin state of particle 1 
independently of the spin state of particle 2, the generators of rotations for the two 
particles must commute: 


[S h S 2 ] = 0 


(5.6) 


You might be concerned that if the spins interact with each other, rotating the spin 
state of particle 1 must affect the spin state of particle 2. That, however, is a different 
matter from determining the possible basis states that can be used to describe such a 
two-particle system. In the next section we will examine which linear combinations 
of the basis states (5.1) are eigenstates of the Hamiltonian when the spins do interact 
in a specified way. Just as in our analysis of the ammonia molecule where we selected 
a '‘natural” basis set with kets 11) and 12) that did not turn out to be eigenstates of the 
Hamiltonian, here too we cannot know a priori which combinations of the basis states 
will be coupled together as eigenstates of the Hamiltonian. Our choice of basis states 
does not make any presumption about how, or even whether, the particles interact. 

It is worth noting that there is a useful way to express the basis states of two spin-1 
particles in terms of single-particle spin states. As a specific example, we denote a 
state in which particle 1 has S l2 = ft/2 and particle 2 has S 2z — —ft/2 by the ket 


I+Z.-z) = |+z),<g>|-z) 2 (5.7) 

The kets on the right-hand side of (5.7) are the usual single-panicle spin states, but 
two of them have been combined together in what is referred to as a direct product 
of the individual ket vectors, forming a two-particle state. We have inserted sub¬ 
scripts on the individual kets to emphasize which ket refers to the state of which 
particle. The symbol ® emphasizes the fact that a special type of multiplication is 
taking place when we combine two vectors from different vector spaces. Until now, 
if we put two vectors together, it was always either in the form of an inner product, 
or amplitude, such as (+z|+x), or in the form of an outer product, such as appears 
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in the projection operator |+z) (+z|. Moreover, the two vectors were always vectors 
specifying a state of the same single particle; that is, they were from the same vector 
space. The right-hand side of (5.7) just expresses in a natural way that we can spec¬ 
ify the basis states of a two-particle system with each of the panicles in particular 
single-particle states. Actually, we can simplify our notation and dispense with the 
direct-product symbol ® altogether, since there is really no other way to interpret 
the right-hand side of (5.7) except as the direct product of the two vector spaces. 
Thus the four basis states (5.1) of the two-particle system can also be expressed by 
a direct product of single-particle states as 

|1) = |+Z, +z) == |+z)]|-l-z )2 |2) = l+Z, —z) = |+z)i|— z) 2 

13) = I—Z, +z) = |—z) 1 |+z) 2 |4) — |—z, —z) = I —z) 1 1—z> 2 (5.8) 

where we have ordered the states from 11) through |4) for notational convenience 
when we use these states as basis states for matrix representations in the next section. 

5.2 The Hyperfine Splitting of the Ground State of Hydrogen 


We are ready to analyze the spin-spin interaction of the electron and the proton 
in hydrogen. Of course, the electron and proton interact predominantly through 
the Coulomb interaction V(r) = —e 2 /r, which is independent of the spins of the 
particles. In Chapter 10 we will see that the energy eigenvalues of the Hamiltonian 
with this potential energy are given by E„ = —13.6 eV/« 2 , where n is a positive 
integer. In addition, there are relativistic corrections, due to effects such as spin-orbit 
coupling, that lead to a fine structure on these energy levels that does depend on the 
spin state of the electron. We will discuss this fine structure in detail in Chapter 11. 
There is, however, another interaction that involves the intrinsic spins of both the 
electron and the proton. Since the proton has a magnetic moment, the proton is 
a source of a magnetic field. The magnetic moment of the electron interacts with 
this magnetic field, generating an interaction energy proportional to the magnetic 
moments of both particles and thus, from (1.3). proportional to the intrinsic spins of 
both of the particles. Because the mass of the proton is roughly 2000 times larger than 
that of the electron, the magnetic moment of the proton is roughly 2000 times smaller 
than that of the electron and the overall scale of this spin-spin interaction turns out 
to be even smaller than the fine structure—hence the name hyperfine interaction. 

The complete form of the spin-spin Hamiltonian follows directly from Maxwell’s 
equations. It involves, of course, not just the magnetic moments of the particles but 
also the distance separating the particles. Fortunately, if we restrict our analysis to the 
ground state of hydrogen, a state with zero orbital angular momentum, the spin-spin 
Hamiltonian can be expressed in the simple form 

„ O A , 

// = —S r S 2 (5.9) 

ri A 
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where Sj is the angular momentum operator of the electron and S 2 is the angular 
momentum operator of the proton. The factor of h 2 in the denominator guarantees 
that the constant A has the dimensions of energy. We will determine the value for A, 
which turns out to be positive, from experiment. In our analysis of the ammonia 
molecule, where there was a term in the Hamiltonian that we also denoted by A, this 
was essentially the best we could do; here, calculating A is fairly straightforward, 
because the hydrogen atom is essentially a two-body problem with well-understood 
electromagnetic interactions. 1 

We are now ready to determine the energy eigenvalues and corresponding eigen¬ 
states of the Hamiltonian (5.9). In order to construct the 4 x 4 matrix representation 
of the Hamiltonian using the basis states (5.8), it is convenient to use the operator 
identity 


2S] • S 2 = 2S lx S 2x + 2Si y S 2 y + 2S lz S 2z 

= 5 1 + 5 2 _ + S\_S 2+ + 2S lz S 2z ( 5 . 10 ) 

where the first line reflects the definition of the ordinary dot product, albeit involving 
operators, while the second line follows from the definition of the raising and 
lowering operators for the two particles: 


V 

— Six + ' Sly 

Si- 

= S ix 

-iS ly 

(5.11a) 

£ 24 - 

= S 2 .x + 1 Sly 

S 2 - 

— Six 

- iS 2 y 

(5.11b) 


The expression (5.10) is useful since it permits us to evaluate the action of the 
Hamiltonian on each of the basis states. For example, a typical diagonal matrix 
element is 


( 1 |£| 1 > 


A_ 

h 2 


(+z, +z|( 5 ’]_|_^ 2 — ~b 5 ]_S 2+ + 25 i z 5 2 z )|+z, +z) 


— — {+z, +z| 25 | z S 2 ,|+z, +z) — 

Art 


A 

2 


( 5 . 12 ) 


Note that the raising and lowering operators change the basis state and therefore 
cannot contribute to a diagonal matrix element (in this case both the raising operators 
yield zero when they act to the right on the ket). A nonvanishing off-diagonal matrix 
element such as the element in the third row and second column is 


< 3 |ff| 2 > = 


— (—z, +zj(S 1+ S 2 _ + Si_S 2+ + 2S lz S 2z )\+z, —z) 

— (—z, +z| 5 , ]_ 5 2 + |+z, —z) = A 

Art 


(5.13) 


1 See, for example. S. Gasiorowicz, Quantum Physics, 3rd edition, John Wiley, New York, 
2003, Chapter 12. 
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where in the second line we have retained the only operator term that can give a 
nonzero contribution to the matrix element. 

Working out the remaining matrix elements, we find that the matrix representa¬ 
tion of the Hamiltonian in the basis (5.8) is given by 


(A/2 0 0 0 \ 

0 -A/2 A 0 

0 A -A/2 0 

v 0 0 0 A/2 ) 


(5.14) 


The energy eigenvalue equation H\ijr) = E\tfr) in this basis is 


(A/2 0 0 0 \ 


(m)\ 



O 

1 

O 


(2m 

= E 

(2m 

ot 

< 

1 

■< 

O 


(M) 


am 

CN 

O 

o 

o 


\ (4|t/r) / 


\m)J 


The energy eigenvalues are determined by the requirement that the determinant of 
the coefficients vanishes: 


A/2 — £ 0 

0 —A/2 — E 

0 A 

0 0 


0 0 

A 0 

-A/2 - E 0 

0 A/2 — E 


(5.16) 


which yields (A/2 - E) 2 [(E + A/2) 2 - A 2 ] = 0. Thus three of the eigenvalues are 
E = A/2 and one of them is E = -3A/2, as indicated in Fig. 5.1. If we substitute 
these energies into (5.15), we obtain the three column vectors 

i- 


/ 1 \ 

0 

0 

w 


l 

7 ! 


/0\ 

1 

1 

Vo/ 


and 


/ 0 \ 

0 

0 

W 


(5.17) 


which represent the normalized eigenstates 


+z, Tz) 


1 

7! 


l+z. 


-z) + 


1 


-z, +z> 


|-z, -z) 


(5.18a) 

(5.18b) 

(5.18c) 
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Ei + A/2 
El 

Figure 5.1 The hyperfine splitting of the ground-state 
energy level of hydrogen. The energy E x is the energy 
Ei - 3A/2 of the ground state excluding the hyperfine interaction. 


with E = ,4/2 and the column vector 


s/2 


( ° \ 

1 

-1 


V 0 ) 


which represents the eigenstate 


s/2 


|+z. 


s/l 


|-z, +z) 


(5.19) 


(5.20) 


with E = -3/1/2. Thus there is a single two-particle spin state for the ground state, 
while the excited state is three-fold degenerate. 

A photon emitted or absorbed in making a transition between these two energy 
levels must have a frequency v determined by hv = 2A. For hydrogen this frequency 
is approximately 1420 MHz, corresponding to wavelength k of about 21 cm, which 
is in the microwave part of the spectrum. The frequency has actually been mea¬ 
sured to one part in 10 13 —v = 1,420,405,751.768 ± 0.001 Hz—making it the most 
accurately known physical quantity. 2 The technique responsible for this unusual 
achievement is our old friend the maser. In the hydrogen maser, a beam of hydro¬ 
gen atoms in the upper energy state is selected by using a Stem-Gerlach device. 
The beam then enters a microwave cavity tuned to the resonant frequency. Because 
of the very long lifetime of a hydrogen atom in the upper energy state, 3 * * the natu¬ 
ral litiewidth is especially narrow and consequently the spectral purity is especially 
high, permitting such an accurate determination of the resonant frequency. Inciden¬ 
tally, the theoretical value for the hyperfine splitting has been calculated to “only” 
1 part in 10 6 , leaving considerable room for theoretical improvement. 

Finally, it should be noted that the 21-cm line of hydrogen provides us with 
an extremely useful tool for investigating the density distribution and velocities of 
atomic hydrogen in interstellar space. The intensity of the radiation received by a 


2 This measurement was first carried out by S. B. Crampton. D. Kieppner, and N. F. Ramsey, 
Phys. Rev. Lett. 11, 338 (1963), who merely obtained v = 1,420,405,751.800 ± 0.028 Hz. 

3 This is a magnetic dipole transition, not the more common electric dipole transition that 

generally leads to a substantially shorter lifetime, as in NH ? . We will discuss these types of 

transitions in Chapter 14. 
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radio-frequency antenna tuned to 1420 MHz is a measure of the concentration of 
the gas, while the Doppler frequency shifts of the radiation provide a measure of the 
velocity of the gas. 

5.3 The Addition of Angular Momenta for Two Spin-j Particles 


In solving the energy eigenvalue problem, we have determined the eigenstates of the 
operator 2Sj • S 2 : 


2Sj■S 2 


and 


|+z, +z) 

-4=1+2. -2) + -4=1-2, +Z> 

V2 V2 

I z, -z) 


2 


|+z, +z) 

-4=1+2, z) + -4=1-2, +z) 

v2 s/2 


-z, -z) 


(5.21) 


2S 1 .S 2 (-L| +Z ,- Z) -2 | 


-Z, +Z> = 


3tf ( 1 , . 1 , 


(5.22) 


These eigenstates have a much deeper significance than has been apparent from 
just our discussion of the hyperfine splitting in hydrogen. To see this significance, 
first consider the infinitesimal rotation operator for a system of two spin-4 particles. 
In order to rotate a two-particle spin ket by angle dft about the axis specified by the 
unit vector n, we must rotate the spin state of each of the particles by the angle d6 
about this axis. See Fig. 5.2. Thus the infinitesimal rotation operator for the system 
is given by 

R(d0 n) = 1 - -S-n d0 

h 


= l 


h 


Sj*- n dO 


1- r S 2 n dO 


n 


(5.23) 


where in the first line we have introduced a new vector operator S whose three 
components are the generators of rotations for the two-particle spin system, and in 
the second line we have expressed this system-rotation operator in the direct-product 
space in terms of the rotation operators for the individual particles, to first order in 
do. Since the components of S generate rotations, these components must satisfy the 
usual commutation relations (3.14) of angular momentum. We call S the total spin 
angular momentum. As (5.23) shows, the total spin angular momentum operator 
S is related to the individual spin angular momentum operators in just the way we 
would expect: 

S = S, <g> 1 + 1 <g> S 2 (5.24) 
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Figure 5.2 A schematic diagram showing the rotation of 
the spins of two spin-up-along-z particles by angle dO about 
the y axis. Note that the generator of rotations S rotates the 
spins but not the positions of the two particles. 


or, more simply, S — S) + S 2 , where, if we are operating in the direct-product 
space, the operator S, is understood to include the identity operator in the vector 
space of particle 2, and so on. The total spin angular momentum is just the sum 
of the individual spin angular momenta. You can also verify directly, using (5.6) 
and the commutation relations of the individual spin operators, that the components 
of the total spin operator (5.24) satisfy the usual commutation relations of angular 
momentum, such as 


Sly + S%] = ih(S] z + S 2;: ) 

(5.25) 

Solving the angular momentum problem for total spin means finding the two- 
particle eigenstates of 

S 2 = (S, + S 2 ) 2 = S 2 + S 2 + 2Sj • S 2 

(5.26a) 

and 


s z = s u + s 2z 

(5.26b) 

From our general analysis of angular momentum in Chapter 3, which was based 
solely on the fact that the angular momentum operators obey (he commutation 
relations (3.14), we know that we can express these eigenstates of total spin in the 
form 

S |s, m) = j?(j + l)/t 2 |s, m) 

(5.27a) 

S z x, m) = mh\s, m) 

(5.27b) 

Since 


S 2 |±z, ±z) = i (| + l) /i 2 |±z, ±z) 

(5.28a) 

and 


S;|±z, dbz) = 5 + 0 h 2 |±z, ±z) 

(5.28b) 
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we see that the eigenstates of 2Sj • S 2 are eigenstates of S 2 as well. Using the 
eigenvalue \h l for 2Sj • S 2 from (5.21), we find that each of the three states in (5.21) 
is an eigenstate of S 2 = S 2 + S 7 + 2S 1 • S 2 with [|(i + 1) + 4(4 + 1) + |]ft 2 = 2/i 2 
as the eigenvalue, or, in the notation of (5.27a), they have 5 = 1 and are spin-1 states. 
The eigenvalue of the single state in (5.22) is [4(j + 1) + 5(5 + 1) — 4] h 2 = 0, 
and thus it is an 5 = 0 state. In fact, each of the states in (5.21) and (5.22) is also an 
eigenstate of the z component of total spin. For example, 

S z |+z, Tz) = (S] z + S 2z )|+z, Tz) = ^ ^ |+z, I z) = h|+z, +z) (5.29) 


Thus, using the ( 5 , m) notation for total spin, we find 
|L» 1) = |+z, Tz) 

|1, 0) = ~\+z> -z) + -U-z, +z> 
V2 V2 


and 


IF -1) = I—z, -z> 

10, 0) = ~\+z, -Z) - 7 = I z, +z) 

V2 V2 


(5.30a) 

(5.30b) 

(5.30c) 

(5.31) 


Thus we have learned how to “add” the spins of two spin-1 particles to make states 
of total spin. 

It is worth noting here that there is another way to see which linear combinations 
of the basis states (5.8) are eigenstates of total spin. Since 


[S 2 , S,} = 0 


(5.32) 


these two operators have eigenstates in common. Because the basis states |+z, +z) 
and |—z, —z) are eigenstates of S z with eigenvalues h and —h, respectively-—and 
they are the only basis states with these eigenvalues for the z component of the 
total spin—they must be eigenstate|, of S 2 as well. As we have seen, they are 
spin-1 states. On the other hand, there are two basis states, |+z, —z) and |—z, +z), 
that are eigenstates of S~ with eigenvalue 0: 


5 z |+z, —z) — (S lz + S 2 ,)|+z, —z) — 



|+z, -z) =0 


(5.33a) 


+ Z ) — (Sj ;2 + S 2z ) |—Z, +Z) — 



-z, +z) = 0 (5.33b) 


For spin 1, the allowed m values are 1,0, and —1, so a linear combination of the 
states |+z, —z) and |—z, +z) must be the missing m = 0 state. We can obtain this 
state by applying the lowering operator 


+ .| 2 _ 


(5.34) 
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to the state 11, 1) or applying the raising operator 

S + = 5 1+ + S 2+ (5.35) 


to the state 11, —1). For example, 

S_| 1, 1) = (S,_ + S 2 _)|+z, +z) 

= h( |-z, +z) + |+z, -z)) 

= V2fi|l,0) (5.36) 

where the last step follows from (3.60) for the total-spin state. Dividing through 
by the factor of V2 h leads to correctly normalized expression for the state |1, 0). 
The other total-spin state, the |0, 0) state, can be determined by finding the linear 
combination of the states |+z, —z) and | —z, +z) that is orthogonal to the 11, 0) state. 
Satisfying the condition (0, 0| 1, 0) = 0 leads to (5.31), up to an arbitrary overall 
phase. 4 

In terms of total-spin states, the spin-spin interaction in hydrogen splits the 
ground-state energy level into two levels, with the triplet of spin-1 states forming 
the upper energy level and the singlet spin-0 state forming the true ground state. The 
magnitude of the hyperfine splitting in hydrogen is roughly 5.9 x 10~ 6 eV, which is 
to be compared with the typical spacing between energy levels that is on the order 
of electron volts. The magnitude of the splitting is indeed quite small in this case. 

An interesting example of spin-spin interaction where the magnitude of the 
interaction is much larger than that between the electron and the proton in hydrogen 
occurs in the strong nuclear interaction that binds quarks and antiquarks (both 
spin-) particles) together to form mesons. In particular, a u quark and a d antiquark 
bind together to form a 7r + , a spin-0 particle. The rest-mass energy of the n + is 
roughly 140 MeV. Changing the total-spin state of the u-d system from a singlet 
spin-0 state to the triplet of spin-1 states generates a different particle, the spin-1 
p + , which has a rest-mass energy of roughly 770 MeV. Thus the energy cost of 
reorienting the spins of the constituent quarks is a hefty 630 MeV. 


DISCUSSION OF THE SPIN-0 AND SPIN-1 STATES 

Before concluding this section, some discussion of these important spin-0 and 
spin-1 states is in order. Two of our initial basis states in (5.8), namely the |+z, +z) 
and | —z, —z) states, cannot be spin-0 states because the projection of the total spin 
on the z axis is nonzero for each of these states; the individual spins are either both 


4 The results we have obtained are, in fact, a special case of a more general result: adding 
angular momentum j) to angular momentum j 2 generates states of total angular momentum j, 
where j takes on values ranging from j l + j 2 to |y) — ) 2 1 in integral steps. See Appendix B fora 
way to generate these states using angular momentum raising and lowering operators. 
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up or both down, respectively. The i/t values for total S : for these states are, of 
course, consistent with their being spin-1 states. We can say that the other two basis 
states, |+z, -z) and |-z, +z), consist of states in which the spins of the individual 
particles are opposite to each other, one spin up and the other spin down in each 
case. Nonetheless, having the spins oppositely directed can produce states with total 
spin of one as well as zero, depending on the linear combination (5.30b) or (5.31) 
one chooses. Clearly, the relative phase between the state |+z, -z) and the state 
|—z, +z) in the superposition is of crucial importance. 

To see the effect of this phase even more clearly, let’s express the states |0, 0) and 
11, 0) in terms of the S x basis states for each of the individual particles. The states 
|(), 0) and 11. 0) are, of course, still eigenstates of S z = 5 lz + S 2z - From (3.102) we 
know that for a single spin-) particle 


i±z)=-U X > ± 4=i-*> 

V2 a/2 


(5.37) 


Using this result, we can express the two-particle total-spin states |0, 0) and 11, 0) as 

11 , 0 ) 


—~|+z, -z) + -U-Z, +z) 
V2 sf2 


— t+z),|—Z >2 -I--=|-z),|+z) 2 

a/2 a/2 


a/2 


+ 


L7! 


(l+x)i + | x) j) 


La/2 


(l+ x )2 ~ I x )2) 


2 L a/2 


(l+x)i — l — x)|) 


LV2 


(|+x) 2 + I —x) 2 ) 


= 4l+x)ll+X> 2 - 1 X)! | X> 2 

a/2 a/2 

1 , , 1 . 

= —pl+X, +x) - -fe|-x, -X) 

a/2 a/2 


(5.38) 


and, in a similar fashion. 


io, o) - 4=i+ z - - z ) - 4=i- z > + z ) 

a/2 v2 


V2 


|+x, -x) 


1 


V2 


-x, Tx) 


(5.39) 


If we make measurements of both 5), and S 2x on the individual spin- ) particles in 
the state 11, 0), we find them with a 50 percent probability both spin up or both spin 
down along the ,r axis, reflecting the fact that this is really a spin-1 state. On the 
other hand, if we make measurements of both .S', ( and S 2x on the particles in the 
state |0, 0), we always find the spins oppositely aligned, one spin up and the other 


/ 
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spin down, but this time along the x axis instead of the z axis. In fact, if we measure 
the components of the spin of the individual particles along an arbitrary axis for the 
|0, 0) state, this opposite alignment of the individual spins must be maintained (see 
Problem 5.3), as would be expected for a state with total spin equal to zero. 


5.4 The Einstein-Podolsky-Rosen Paradox 


Consider a spin-0 particle at rest that decays into two spin-4 particles. 3 In order to 
conserve linear momentum, the two particles emitted in the decay must move in 
opposite directions. In order to conserve angular momentum, the spin state of the 
two-particle system must be |0, 0), assuming there is zero relative orbital angular 
momentum. Two experimentalists, A (Alice) and B (Bob), set up Stem-Gerlach 
measuring devices along this line of flight, as depicted in Fig. 5.3. Each observer 
is prepared to make measurements of the intrinsic spin of the particles as they pass 
through their respective SG devices. 

What can we say about the results of their measurements? Let's call the particle 
observed by A particle 1 and the one observed by B particle 2. We call the line of flight 
of the two particles the y axis. If A and B both decide to make measurements of S z on 
their respective particles and A obtains a value of Sj z = h/ 2, B must obtain a value 
of S 2z = —h/ 2. From the expression (5.39) for the |0, 0) state, we see that there is a 
50 percent probability to find the system in this |+z, —z) state. Similarly, if A obtains 
,Sj z = —/i/2, B must obtain % = h/2. Again, there is a 50 percent probability to find 
the system in the state | —z, +z). The striking thing about these results is that if A 
measures first, A’s measurement has instantaneously fixed or determined the value 
that B can obtain, even though the two particles may be completely noninteracting 
and A and B may be separated by light-years. 

Al though we have argued that the relative phase between the basis states |+z, —z) 
and |—z, +z) in the |0, 0) state matters, if you still tend to think in classical terms that 
a |0, 0) state is just a 50-50 mix of these two states, this instantaneous determination 
of B's result by A's measurement doesn’t seem so strange. Imagine that you hold in 
your hand two colored balls that are identical in feel but one is green and the other 
is red. You separate the balls without looking and put one in each hand. If you look 
at the ball in your left hand and find it is red, you have immediately determined 
that the ball in your right hand is green, even before you open your right hand. 
You would, of course, presume that the ball in your right hand was green all along, 
whether or not you had opened your left hand to check the color of the ball. This 


5 We are viewing this as a thought experiment. An actual example ofa spinless particle decaying 
into two spin-) particles is the rare decay mode of the >j meson into a /z + —/j pair. However, in 
order to measure the spin with an SG device in our thought experiment, we need to presume that 
the particles emitted in the decay are neutral. 
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Figure 5.3 A schematic of the EPR experiment in which A measures the spin of 
particle l and B measures the spin of particle 2. 


cannot, however, be an adequate explanation of what is going on the spin system. 
The reason is that A and/or B can choose to measure a different component of the 
spin of the particles. Suppose that both A and B decide to measure S x instead ot S z . 
As (5.39) indicates, if A obtains the value S lx = h/ 2, then B must obtain S 2x — -/i/2. 
Similarly, if A obtains S Lv = -h/2, then B must obtain S 2x = li/2. Here again, the 
results of their measurements are completely correlated and measurements by A can 
determine the results of measurements by B. As we have been in Chapter 3, however, 
spin-/ particles cannot have definite values for both S z and S x . The state of particle 2 
as it travels toward B’s SG device, for example, cannot be an eigenstate of both S 2z 
and S 2x . In our example of the colored balls, it would be similar to the balls having 
two other colors such as blue and yellow, as well as red and green. Finding one 
of the balls to be yellow demands that the other is blue, just as finding one of the 
balls to be red demands that the other ball be green. However, a single ball cannot 
simultaneously have two colors andfbe, for example, both red and yellow. So what 
color is the ball in your right hand before you look? 6 

The idea that particles do not necessarily have definite attributes has been implicit 
in our discussion of quantum mechanics from the beginning. A single spin- / particle 
in the state |+x) does not have a definite value for S~. Before a measurement is 
carried out, we can only give the probabilities of obtaining S z = h/2 or S, = —h/2', 

6 Note that if A chooses to measure S : and B chooses to measure S x , the results of their 
measurements will be completely uncorrelated. It A, for example, obtains Sj- = h/2. then since 

|(+z, +x|0, 0}| 2 = |<+z. -x|0, 0)| 2 = | 

B has equal probabilities of obtaining .ST, = ft/2 and S 2x = —h/2. 


/ 
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the particle has amplitudes to be in both the state |+z) and the state |—z). Once 
a measurement of 5. is made, however, this uncertainty in the value of S z for the 
particle disappears; the particle is then in a state with a definite value of S z . The 
new feature that is raised by our discussion of the two-particle system is that a 
measurement carried out on one of the particles can immediately determine the state 
of the other particle, even if the two particles are widely separated at the time of 
the measurement. This is the straightforward result of applying quantum mechanics 
to a two-particle system. A measurement of S l2 unambiguously selects either the 
jTz.—z) or the |— z,+z> state. A measurement on part of the system in the form 
of a measurement on one of the particles in this two-particle system is really a 
measurement on the system as a whole. 

Not everyone has been happy with this state of affairs. In particular, Albert 
Einstein never liked the idea that a single particle could be in a state in which the 
particle did not have a definite attribute, be it spin or position. In his view, this 
meant that physical properties did not have an objective reality independent of their 
being observed. For Einstein there was a more reasonable position. Although the 
results of measurements carried out on a single particle are in complete accord with 
quantum mechanics, these results do not of themselves demand that a particular 
particle does not have a definite attribute before the measurement is made. As we 
discussed in Section 1.4, testing the predictions of quantum mechanics requires 
measurements on a collection of particles, each of which is presumed to be in the 
same state. Thus Einstein could believe that 50 percent of the particles in the state 
|+x} also had S . = h/2 and that 50 percent had S z = —hf 2 but that we are unable 
to discriminate between these two types of particles, as if the attribute that would 
allow us to distinguish the particles was hidden from us—hence a hidden-variable 
theory of quantum mechanics. 

In order to show how unsatisfactory the conventional interpretation of quantum 
mechanics really was, Einstein, Podolsky, and Rosen devised the ingenious thought- 
experiment on a two-particle system of the type that we have been describing in this 
section. 7 It is one thing to have a definite attribute for a particle dependent on having 
made a measurement of that attribute on the particle, but it is even more unusual 
to have that attribute determined by making a measurement on another particle 
altogether. To Einstein this was completely unacceptable: “But on one supposition 
we should, in my opinion, absolutely hold fast. The real factual situation of the 
system S 2 is independent of what is done with the system S 1? which is spatially 


7 A. Einstein, B. Podolsky, andN. Rosen. Phys. Rev. 47. Ill (1935). The particular experiment 
described in their paper involved measurements of two different noncommuting variables, position 
and momentum, instead of two components of the intrinsic spin such as S z and S x . The general¬ 
ization of their argument to spin-j particles was initially made by D. Bohm, Quantum Theory, 
Prentice-Hall, Englewood Cliffs, N.J., 1951, pp. 614-619. 
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Figure 5.4 A schematic of the EPR experiment in which 
B measures the spin of particle 2 with an SGz device and A 
measures the spin of particle 1 with an SGn device, where 
the inhomogeneous magnetic field in the SGn device 
makes an angle 0 in the x-z plane. 


separated from the former.'Because the conventional interpretation ol quantum 
mechanics, which we have used in analyzing measurements by A and B on this two- 
particle system, is so completely at odds with what Einstein termed any “reasonable 
definition of the nature of reality,” which includes this locality principle, the issue 
raised in their 1935 paper is generally referred to as the Einstein-Podolsky-Rosen 
paradox. 

EXAMPLE 5.1 In an EPR experiment the orientation of A’s SG device is at 
angle 0 in the x-z plane, while ihe orientation of B's SG device is along the 
z axis, as indicated in Fig. 5.4. Show that 50 percent of B's measurements 
yield S 2 , = h/2 and 50 percent yield S 2z = -h/2 independent of 9. 

tr 

SOLUTION In the x-z plane, we use the single-particle spin-up and spin- 
down states 

|+n) = cos —■ |+z) + sin ^|-z) |-n) = sin °-\+z) - cos ^|-z) 

The system of two particles is in the total-spin-0 states, 

W) = |0, 0) = -U+z, -z) - -J=|-z, +z) 


8 A. Einstein, in P. A. Schilpp, ed„ Albert Einstein, Philosopher-Scientist. Tudor, New York, 
1949, p. 85. 
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Thus 

(+n, +z|0, 0) = 

(+n, -z|0, 0) = 

(-n, +z|0, 0) = 

(-n, —z|0, 0} - 

The probability of B obtaining S z = h/2 is 

|(+n, +z|0, 0> | 2 + |{-n, +2)0, 0}| 2 — - sin 2 - + - cos 2 - = - 

2 2 2 2 2 

Thus B's measurements alone do not contain any information about the 
orientation 0 of A's SG device. 


--L (+n , +zl _ z , +z) = __L (+ „ l _ z) = __L sin f 

-f (+„. -z|+z, -z) = -f (+11 | +z) = _L CO s f 

1 / ,1 , v 1 . . , 1 6 

—P<-n, +z|-+ +Z> = —7-{-n -z) = - — cos - 
v 2 v 2 v2 2 

1 / , , , 1 , , , 1 . 6 ) 
_,_ n ._ z|+z ._ z) = _ M+z) = _ sin _ 


5.5 A Nonquantum Model and the Bell Inequalities 


Until 1964 it was believed that one could always construct a hidden-variable theory 
that would give all the same results as quantum mechanics. In that year, however, 
John S. Bell pointed out that alternative theories based on Einstein’s locality prin¬ 
ciple actually yield a testable inequality that differs from predictions of quantum 
mechanics. 9 As you might guess from our earlier discussion about measurements 
of S z for a particle in the state |+x), this disagreement cannot be observed in mea¬ 
surements on a single particle. Rather, it is a prediction about correlations that are 
observed in measurements made on a two-particle system such as the two spin-j 
particles in a singlet spin state. 

Let us first see how we can construct a local theory in which particles have their 
own independent attributes that can account for all the results of measuring S z or 
S x on a system of two particles in the 0. 0) state. As the particles travel outward 
toward the SG devices, there is no way to know in advance what the orientation of 
these devices will be. In fact, A and B may alter the orientation of their respective SG 
devices while the particles are in flight. The “local realist” wants each of the particles 
to possess its own definite attributes with no inherent uncertainty. Thus each particle 
must carry with it all the information, or instructions, necessary to tell the SG device 
what to yield if a measurement of S z or S x for that particle is made. For example, 


9 J. S. Bell, Physics 1, 195 (1964). 
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a single particle such as particle 1 may be of the type {+z, +x}, indicating that A 
obtains ft/2 for a measurement of S lr or ft/2 for a measurement of Note that 
we are inventing a new { } notation to provide a nonquantum description of the state 
of the particle. In this model, particle 1 is presumed to have definite values S ]z and 
Sy, which is completely at odds with our earlier analysis of the allowed angular 
momentum states of a particle in quantum mechanics. However, in order to avoid 
obvious disagreements with experiments such as the Stem-Gerlach experiments of 
Chapter 1. we are not suggesting that A can simultaneously measure /fo and Sy 
for this particle. A’s decision to measure S ix , for example, on a particle of the type 
{+z, +x} means that A forgoes the chance to measure Sy on this type of particle. 
The value of S lz of the particle is essentially hidden from us. In fact, making a 
measurement of S h and obtaining ft/2 must alter the state of the particle. After this 
measurement of S lx on a collection of particles of the type {+z, +x), 50 percent of 
the particles would now be of the type {+z, +x} and 50 percent would be of the type 
{—z, +x}. In this way, the local realist can reproduce the results of Experiment 3 in 
Chapter 1 on a single particle. 

Conservation of angular momentum for two particles in a spin-0 state requires 
that particle 2 be of the type {—z, —x} if particle 1 is of the type {+z. +x). Let us 
assume that four distinct groups of the two particles are produced in the decay of a 
collection of spin-0 particles: 


Particle 1 

Particle 2 

{+z, +x) 

{ z, -x} 

(+Z, -X} 

{-Z, +x) 

{- z, +x) 

{+Z, -x} 

{-z, -X} 

{+z, 4-x} 


and that each of these distinct groups of particles is produced in equal numbers. 
If A and B both make measurements of S z or both make measurements of S x on 
their respective particles, the result! are consistent with conservation of angular 
momentum (and the predictions of quantum mechanics) since they always find the 
spin components of their particles pointing in opposite directions. In addition, if A, 
for example, makes measurements of 5 lz and obtains the value ft/2 and B makes 
measurements of S 2x , 50 percent of B’s measurements will yield ft/2 and 50 percent 
will yield —ft/2, since 50 percent of B’s particle must be of the type {—z, —x} and 
50 percent must be of the type {—z. +x}. Thus this simple, nonquantum model in 
which each of the particles in the two-particle system has definite attributes is able to 
reproduce the results of quantum mechanics. Moreover, in this model the results that 
B obtains are completely predetermined by the type of particle entering B’s detector, 
independent of what A chooses to measure. This makes the local realist happy. 

We now want to show that this simple model cannot reproduce all the results of 
quantum mechanics in a somewhat more complicated experiment in which A and B 
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agree to make measurements of the spin along one of three nonorthogonal, coplanar 
directions specified by the vectors a, b, and c. Each of the particles must now belong 
to a delinite type such as {+a, —b, +c}, for which a measurement by A or B on a 
particle of this type would yield h/2 if the SG device is oriented along the direction 
specified by a or c, but would yield -h/2 if the SG device is oriented along the 
direction specified by b. Again, in order to conserve angular momentum, if particle 1 
is of the type {+a, -b, +c}, then particle 2 must be of the type {-a, +b, —c), so that 
if A finds particle 1 to have its spin up or down along some axis, B finds particle 2 to 
have its spin oppositely directed along the same axis. There are now eight different 
groups that the two particles emitted in the decay of a spin-0 particle may reside in: 


Population 

Particle 1 

Particle 2 

A, 

{+a, +b, +c} 

{-a, -b, -c} 

n 2 

(+a, +b, -c) 

{-a, -b, +c) 

n 3 

{+a, -b, +c} 

{-a, +b, —c} 

n 4 

{+a, -b, -c} 

{-a, +b, Tc) 

n 5 

{—a, +b, +c} 

{+a, -b, -c} 


{-a, +b, -c} 

{+a, ~b, +c) 

n 7 

{-a, -b, +c) 

{+a, +b, -c) 

a 8 

{-a, -b, -c} 

{+a. +b, +c) 


First, let’s consider an experiment in which A and B orient their SG devices at 
random along the axes a, b, and c, making measurements of the spin of the particle 
along these axes. 10 Let’s examine the correlations in their data for those cases in 
which their SG devices are oriented along different axes. In particular, let's see what 
fraction of their measurements yield values for the spin of the two particles that have 
opposite signs, such as would be the case, for example, if A finds particle 1 to have 
■ t) ia = h/2, and B finds particle 2 to have S ^ — —h/2. Clearly, all measurements 
made on particles in populations N\ and (Vg will yield opposite signs for the spins of 
the two particles. On the other hand, for population A 2 , when A finds S la = H/2, B’s 
measurement yields the result S 2h = -li/2 (with the opposite sign) if B’s SG device 
is oriented along b, but if instead B’s SG device is oriented along the c axis, B 
obtains S 2c = h/2 (with the same sign). Similarly, if A’s SG device is oriented along 
the b axis, A finds — h/2 while B finds S 2a = —h/2 or S 2c = h/2, depending 
on whether B’s SG device is oriented along a or c, respectively. Finally, still for 
population N 2 , if A's SG device is oriented along the c axis, A obtains S’ lt . = -h/2. 


10 This thought experiment was suggested by N. D. Mermin, Am. J. Phys. 49, 940 (1981). 
See also his discussion in Physics Today , April 1985. Our derivation of the Bell inequality (5.54) 
follows that given by J. J. Sakurai, Modern Quantum Mechanics, Benjamin-Curnmings Menlo 
Park, CA, 1985. 
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while B finds S 2a = —h /2 or S 2b = —h/2. Thus, overall for populations N 2 , | =* | 
of the measurements yield results with opposite signs when the SG devices are 
oriented along different axes. This ratio holds for all the populations N 2 through /V 7 . 
Since measurements on populations and /V 8 always yield results with opposite 
signs, independent of the orientation of the SG devices, at least one-third of the 
measurements [in fact, | (|) + (|) = \ of the measurements if all eight populations 
occur with equal frequency] will find the particles with opposite signs for their spins 
when the two experimentalists orient their SG devices along different axes. 

Although this result seems straightforward enough, we can quickly see that it is 
in complete disagreement with the predictions of quantum mechanics, at least for 
certain orientations of the axes a, b, and c. We express the 10, 0} state as 


[0, 0) = — |+a, -a) 
v2 


1 

-s/2 


-a, +a) 


(5.42) 


The amplitude to find particle 1 w-ith Sj,. = —hj 2 and particle 2 with S 2h = h/2 is 
given by 


1 1 

(-a, +bj0. 0) = -(-a, +b|+a, -a)--<-a, +b|-a, +a) 

V2 v2 

= ~-^{-a, +b|—a, +a) = - -J= ( t f—a|—a),) ( 2 (+b|+a> 2 ) 


1 

’7! 


(+b|+a) 


(5.43) 


where we have expressed the two-particle state in terms of a direct product of single¬ 
particle states to evaluate the amplitude in terms of single-particle amplitudes. We 
have also dropped the subscripts on the last amplitude, which involves only a single 
particle. From our earlier work (see Problem 3.2), we know that 

|+n) = cos-|+z) + e 1 ^ sin-|— z) (5.44) 

2v 2 

Thus (+z|+n) = cos(0/2), where 9 is the angle n makes with the z axis. Therefore, 

<+b[+a)=cos^ (5.45) 

where 9 ab is the angle between the a and the b axes, as shown in Fig. 5.5. The 
quantum mechanical prediction for the probability of finding the particles in the 
state |— a, +b) is 

[ (-a, +b|0, 0) | 2 = ^ cos 2 (5.46) 

Similarly, 

|{+a, —b|0, 0)| 2 = ^ cos 2 (5.47) 
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Figure 5.5 Two axes a and b used for measuring the 
spin. 


Thus the total probability that A and B obtain opposite signs for the spin when they 
make measurements with A’s SG device oriented along a and B’s SG device oriented 
along b is given by 

|{+a, — b|0, ())| 2 + | {-a, +b|0, 0)| 2 = cos 2 ^ (5.48) 

2 

Clearly the probability would be the same if A’s SG device is oriented along b and 
B’s device is oriented along a. Now Jet’s choose the axes a, b, and c, as shown 
in Fig. 5.6. With the angle 9 ab = 120°, the probability (5.48) is simply But the 
probability 


|(+a, — cjO, 0)| 2 + | {-a, +c|0, 0)| 2 = cos 2 ^ (5.49) 

2 

is also equal to | since the angle 9 ac = 120°. In fact, since the angle d bc = 120° as 
well, quantum mechanics predicts for the particular orientation of the axes shown in 
Fig. 5.6 that exactly one-quarter of the measurements will yield values with opposite 
signs for the spins along different axes, in direct disagreement with the model in 
which each of the particles possesses definite attributes, where at least one-third of 
the measurements yield values with opposite signs. Thus it should be possible to 
test which is right—quantum mechanics or a model in which the particles possess 
definite attributes—by performing an experiment. 



Figure 5.6 One orientation of the axes a, b, and c that 
leads to disagreement between quantum mechanics and a 
model in which the particles possess definite attributes. 
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Interestingly, we can extract a variety of inequalities from the supposition that 
the particles can be grouped into populations of the type (5.41), inequalities that 
may be easier to test in practice than the experiment that we have just described. 
Without having to specify the relative populations of the different groups (5.41), we 
may quickly see, for example, that certain inequalities such as 


+ N 4 < (N 2 + JV 4 ) + (W 3 + N 7 ) (5.50) 

must hold. But 

^W (+a;+b) (5.51) 

is the probability that a measurement by A yields S la = ft/2 for particle 1 and a 
measurement by B yields Sih = ft/2 for particle 2. Only populations iV 3 and N 4 
contain particle types satisfying both these conditions. Similarly, 


N 2 + Aft 
£«• Nt 


P(+ a; +c) 


(5.52) 


is the probability that a measurement by A yields .Sj u = ft/2 for particle 1 and a 
measurement by B yields S 2r = ft/2. Also 


N 3 + N 7 

£, Ni 


= P(+ c;+b) 


Thus the inequality (5.50) may be expressed as 


(5.53) 


P(+ a; +b) < P(+a; +c) + P(+ c; +b) (5.54) 


which is known as a Bell’s inequality. In order to test this inequality, A and B 
just make measurements to determine the three probabilities. First A's SG device is 
oriented along a, while B’s is fixed along b, and measurements are made to determine 
P(+ a; +b). A and B then go on to measure P(+a; +c) and P(+c; +b). 

The inequality (5.54) is in a form that is easy to compare with the predictions of 
quantum mechanics. In particular, since 


(+a, +b|0, 0) = — (+a, +b|+a, -a)-~{+a, +b|-a, +a) 

V2 V2 


1 

V2 


and 


(+b|—a) 


(+b|—a> = sin 


(5.55) 


(5.56) 


the prediction of quantum mechanics for the probability is given by 


P(+a;+b) = |(+a,+b|0, 0)| 


1 -2 @ab 

- sin — 

2 2 


(5.57) 
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Figure 5.7 An orientation of the axes a, b. and c where c bisects the 
angle between a and b. 


Note that if b = a, then ®ah = and P(+ a; +a) = 0, as it must for two particles in 
a state with total-spin 0. Also, if b = —a, then 0 ab = n and P(+a; —a) = again 
the usual result for a total-spin-0 state. If we generalize the result (5.57) to the other 
two terms in the Bell’s inequality (5.54), we obtain 

sin 2 — < sin 2 — + sin 2 — (5.58) 

2 2 2 

As in our earlier discussion, this inequality is violated for certain orientations of a, 
b, and c. To see the disagreement in a particular case and to make the algebra easy, 
let’s take the special case where c bisects the angle 0 ah , as shown in Fig. 5,7. If we 
call 9 ab — 20, then 6 ac = 6 bc = 0, and the inequality (5.58) becomes 


, , 0 
sin" 0 < 2 sin 2 — 

2 

In particular, let 0 = n/3 = 60° as a specific example; we then obtain 


(5.59) 



(5.60) 


again a marked disagreement between the predictions of quantum mechanics and 
those of a local, realistic theory. In fact, this particular choice of angles is the same 
as in our earlier discussion. Just let c ->• — c to go from Fig. 5.6 to Fig. 5.7. Then 
spin-down along c is spin-up along -c. As Fig. 5.8 shows, (5.58) is violated for all 
angles 0 satisfying 0 < © < n/2. Thus it should be possible to test the predictions of 
quantum mechanics by observing the correlations in the spins of the two particles for 
a variety of angles. Based on our earlier discussion, if quantum mechanics is correct 
and Bell’s inequality is violated, no local hidden-variable theory can be valid. 


EXPERIMENTAL TESTS AND IMPLICATIONS 

Bell’s results have inspired a number of experiments. With the exception of one 
experiment that measured the spin orientation of protons in a singlet state, these 
experiments have all been carried out on the polarization state of pairs of photons 
rather than on spin-) particles. Suitable optical photons are produced in the cascade 
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Figure 5.8 (a) The unit vectors a, b, and c specifying the orientation of three SG 
devices for measuring the spins of the two spin-i particles emitted in a total-spin-0 
state Each of the SG devices has its measurement axis transverse to the direction ot 
flight of the two particles, and therefore the unit vectors all lie in a plane with their 
tips on a circle. Note that the square of the length of the vector pointing between a 
and b is given bv |a - bf - a 2 + b 2 ^2a ■ b = 2(1 - cos 0 ab ) = 4 s,n 2 6 ah /2. Similarly. 
, a _ C |2 = 4 sin “ 0 /2 and |b - c| 2 = 4 sin 2 6 bc /2. Thus, expressed in terms of these 
lengths, the inequality (5.58) becomes |a - b| 2 < |a - c| 2 + |c - b| . (b) The angle 6 ab 
is taken to be jr, in which case the triangle formed by |a - b|, |a - c|, and |c b| is a 
right triangle and therefore |a - b| 2 = |a - c| 2 + |c - b| 2 . Note that in (b), (c) and (d), 
the vectors a. b, and c are not actually shown, but you can see their direction by noting 
the points where they intersect the unit circle, (c) The angle 6 ab < n and since thc angle 
q > jr/2 |a _ b| 2 > |a - c| 2 + |c - b| 2 and the Bell inequality (5.58) is violated, (d) The 
angle 0 a ' b > tr, making 0 < rr/2 and |a - b| 2 < |a - c| 2 + |c - b| 2 , in accord with the 
Bell inequality. 
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Figure 5.9 Correlation of polarizations as a function 
of the relative angle of the polarimeters. The indicated 
errors are ±2 standard deviations. The dotted curve is 
not a fit to the data, hut the quantum mechanical predic¬ 
tions for the actual experiment. See Problem 5.10. For 
ideal polarizers, the curve would reach the values ±1. 
Adapted from A. Aspect, P. Grangier. and G. Roger, 
Phys. Rev. Lett. 49, 91 (1982). 


decays of atoms such as Ca or Hg excited by laser pumping in which the transition 
is of the form 


(/ = 0) A (J = 1)-I> (J = 0) 

and the photons are emitted essentially back to back in the state 

W ] = -^\ R - R) + ~\LL) (5.61) 

The correlations are between measurements of the linear polarization for each of 
the photons. The most precise experiments of this type have been carried out by 
A. Aspect et al. in 1982. In one case the Bell inequality was violated by more than 
nine standard deviations. On the other hand, the agreement with the predictions of 
quantum mechanics is excellent, as shown in Fig. 5.9. More recently, the technology 
of making these measurements has improved significantly through the use of spon¬ 
taneous parametric down-conversion (SPDC), a process in which a single photon 
splits into a pair of polarization-entangled photons through interaction in a nonlin¬ 
ear crystal. Using SPDC, P. Kwiat et al. obtained a violation of a Bell’s inequality 
by 242 standard deviations in less than three minutes of data taking. 11 

These results do not make the local realist happy. One of the disturbing features 
of these results to the local realist (and, perhaps, to you too) is understanding how A’s 
measurements on particle 1 can instantaneously fix the result of B’s measurement 
on particle 2 when the two measuring devices may be separated by arbitrarily large 
distances. In the experiments of Aspect et al.. the separation between these devices 
was as large as 13 m. Although we do not have any mechanism in mind for how 
the setting of A’s measuring device could influence B’s device, in these experiments 
the devices are left in particular settings for extended periods of time. Maybe B’s 
device knows about the setting of A's device in ways we don’t understand. In 
order to eliminate the possibility of any influence. Aspect et al. have carried out 
one experiment in which the choice of analyzer setting was changed so rapidly that 


11 P. Kwiat, E. Waks, A. White, I. Appelbaurn, and P. Eberhard, Phys. Rev. A 60, R773 (1999). 
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A’s decision on what to measure could not have influenced B’s result unless the 
information about the choice of setting was transmitted between A and B with a 
speed faster than the speed of light. 12 Even in this case the quantum mechanical 
correlations between the measurements persisted. Strange as these correlations may 
seem, they do not permit the possibility of faster-than-light communication. In the 
spin system, for example, 50 percent of B’s measurements of S z yield S 2z = h/2 
and 50 percent yield S 2 z = —h/2 whether or not A has made a measurement and no 
matter what the orientation of A's SG device (see Example 5.1). It is only when A 
and B compare their data after the experiment that they find a complete correlation 
between their results when they both oriented their SG devices in the same direction. 

So where does all this leave us? Certainly with a sense of wonder about the way 
the physical world operates. It is hard to guess how Einstein would have responded 
to the recent experimental results. As we have noted, he believed particles should 
have definite attributes, or properties, independent of whether or not these properties 
were actually measured. As A. Pais recounts: “We often discussed his notions on 
objective reality. I recall that during one walk Einstein suddenly stopped, turned to 
me and asked whether I really believed that the moon exists only when 1 look at it.” 13 
In the microscopic world, the answer appears to be yes. 

5.6 Entanglement and Quantum Teleportation 


In science fiction, teleportation is the feat of making an object disappear in one 
place and reappear (perhaps instantaneously) somewhere else. It is unclear, of course, 
how this process is supposed to work. It is, after all, science fiction. Apparently, the 
object being teleported is scanned in some way (and destroyed) and a replica of the 
object is reassembled at another location. Although science fiction typically focuses 
on teleporting a macroscopic object, e.g. Captain Kirk, it is fair to ask whether 
teleportation is possible on a microscopic scale. 

If we imagine trying to teleportjthe state of a single spin-1 particle, we face 
a daunting challenge when it comes to scanning the state | xjf) = a\+z) + b\— z). 
Determining the probabilities \a\ 2 and \b\ 2 that the particle has S z = h/2 or 
S z = —h/2 requires repeated measurements of S z on an ensemble of particles each 
in the state |t/r), with each measurement collapsing the state \xj/) to the state |+z) or 
| —z), respectively. And these measurements would not tell us the relative phase of 
the amplitudes a and h, which is also needed to reconstruct the state. Determination 
of this phase would require measurement of an additional quantity such as (S x ) or 
(5 V ). One possible way out might be to clone many copies of the original state and 
make the repeated measurements on these copies. But as Example 5.2 at the end of 


12 A. Aspect, .1. Dalibard, and G. Roger, Phys. Rev. Lett. 49, 1804 (1982). 

13 A. Pais, Rev. Mod. Phys. 51, 863 (1979). 
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this section shows, cloning is not possible in quantum mechanics. Therefore, scan¬ 
ning the original quantum state to obtain the information that must be teleported 
cannot be done. Thus it was a surprise when, in 1993, Bennett et al. pointed out that 
teleportation of the state \\fr) is possible provided the information contained in the 
state is not actually determined. 14 

Let’s call the spin-1 particle whose state we wish to teleport particle 1. We will 
call the person sending the state Alice and the person receiving the state Bob. 15 Alice 
could of course just send Bob the particle itself. But this can take time, especially if 
Alice and Bob are far apart. Moreover, it may be difficult to maintain particle 1 in the 
state |i Jr) during the transmission process. There may be interactions, such as stray 
magnetic fields, that cause the relative phase between the spin-up and spin-down 
states to change. The strategy for teleportation is to start with two other spin-1 par¬ 
ticles, particles 2 and 3, that are entangled in the total-spin-0 state (5.31) that has been 
the focus of our discussion of the Einstein-Podolsky-Rosen paradox. Assume that 
particles 2 and 3 travel outward from the location in which they are put in this total- 
spin-0 state to Alice and Bob, respectively. See Fig. 5.10. When particle 2 reaches 
Alice, she performs a measurement that entangles particle 1 and particle 2 together, 
potentially passing, as we will show, the information contained in particle 1 instan¬ 
taneously to particle 3, which has never been in contact with particle 1. Since the 
state |i//) of particle 1 is destroyed in this process, it is appropriate to call the process 
teleportation (and not replication). However, in order for Bob to maneuver particle 
3 into exactly the same state as particle 1 was in initially, he needs to know the result 
of Alice’s measurement, which is sent through an ordinary classical channel, say by 
telephone or email. Strange as it may seem, particle 2, the intermediary in this tele¬ 
portation process, interacts first with particle 3 and then with particle 1, even though 
you probably would have thought that to convey the information in particle 1 to 
particle 3, particle 2 should interact with particle 1 before it interacts with particle 3. 

In this description of the teleportation process, we have used the word entangled 
a couple of times. The concept of entanglement has a precise definition in quantum 
mechanics. 16 Let’s first look at the total-spin-0 state of two spin-j particles 


|0,0> = |< , ) = ~|+z)2|-z)3-^= 


-Z)2l+Z> 


(5.62) 


14 C. H. Bennett, B. Brassard. C. Crepeau, R. Joza, A. Peres, and W. K. Wooters, Phvs. Rev. 
Lett. IQ. 1895 (1993). 

15 In the field of quantum cryptography [see V. Scarani, Rev. Mod. Phys. 81 , 1301 (2009)], the 
person who might be trying to intercept the message, or the quantum state, is typically called Eve. 

16 It was Schrodinger who first introduced the concept of entanglement in a paper he wrote in 
1935. following up on the EPR paper. Nonetheless, the term entanglement was not widely used 
until the early 1990s, when articles like the one by Bennett et al. on quantum teleportation led 
to the realization that quantum entanglement was an important resource that could be utilized in 
novel ways. 
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Figure 5.10 In quantum teleportation, particles 2 and 
3 are entangled in an EPR pair, such as (5.62). Alice 
performs a Bell-state measurement entangling particle 1, 
which was initially in the state |i/r), and particle 2. This 
measurement destroys the state |i/r). Alice then sends the 
result of her measurement to Bob, who performs a unitary 
transformation (a spin rotation in the example discussed 
in this section) transforming particle 3 into the state |i jr). 


where the subscripts 2 and 3 label the single-particle states of the two particles in the 
superposition. In addition to specifying this state as the total-spin-0 state |0, 0), we 
have also labeled this state as d't,' 1 ) for reasons that will be apparent shortly. We 
say |' 1 ^ 3 ) ) is an entangled state because the state cannot be factored into the product 
of two single-particle states. That is, it is not a state of the form 

|fl} 2 ®|*>3=|a>#>3 (5 ’ 63) 


The state of particle 1, the particle Alice wishes to teleport, is simply 

=a\+z)i + b\-z)i ( 5 - 64 ) 

t 

.- 5 *- 

where we have added a subscript to the kets to emphasize that these kets refer to 
the single-particle state of particle 1. Before Alice makes a measurement, the three- 
particle state is 

\1r m ) = («l+z)i + b\-zh) (-^l+z} 2 l-z>3 - ^ 1-Z)2l+Z>3 ) (5 ' 65) 
which can be rewritten as 


| f m ) = —}= (H-z) 1 |+z) 2 |—z ) 3 - l+z)jl— z> 2 H-z) 3 ) 

V 2 

+ -^= (l —Z)i|+Z>21 z) 3 - |- z )l|- z )2l+ z )3) 

V 2 


(5.66) 
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Notice that while particles 2 and 3 are entangled, particle 1 is not entangled with the 
other two particles. The state |i/r 123 ) is just a direct product of the state j i/r,} and the 
state 1^23 ’>, that is \f U3 ) = I7t) ® I'^V 

The key step to teleportation is for Alice to make a special type of measurement 
called a Bell-state measurement that projects | t/r 123 ) onto the Bell basis, a complete 
set of states for which each state in the basis entangles particles 1 and 2. Two of the 
Bell basis states are defined to be 

I*!?) = ^l+z>il - z >2 ± -^|-z),l+z> 2 (5.67) 


As we have noted, the state I'hp *) is the total-spin-0 state |0, 0) [as in (5.62), but 
here for particles 1 and 2 instead of particles 2 and 3]. The state is the total- 

spin-1 state 11, 0). In order to span the space of two spin-| particles, we need two 
additional basis states. We choose 


= -^I+Z}ll+Z) 2 ± -4?|- Z >l|- Z >2 


V2' 


72 


(5.68) 


Unlike the states Ifi'ju’) and |'I' I < + ) ), the states and lOp*} are not total- 

spin eigenstates. They are linear combinations of the states 11, 1) = |+z> 1 |+z ) 2 and 
11 , — 1 ) = |—z) 1 |-z) 2 , linear combinations that entangle the two particles. 
Expressed in terms of these Bell basis states, the state 1 1 /^ 23 } becomes 

17)23) — “ 1^12 *) ( zz| —f-z )3 — b\— z) 3 ) + -|'hj 2 ) ) (—n|+z ) 3 + b\— z) 3 ) 

+ ( 6 |+z >3 + tf|-z} 3 ) + (-*|+z > 3 + fl|-z) 3 ) (5.69) 


Each state in the superposition occurs with probability (1/2 ) 2 = 1/4 if a Bell-state 
measurement is carried out. If Alice’s Bell-state measurement on particles 1 and 2 
collapses the two-particle state to the state I'l'Jj *), for example, then particle 3, Bob’s 
particle, is forced to be in the state |y> 3 ) = —a|+z ) 3 - £>|—z) 3 , which is exactly the 
state j \jr), up to an overall phase, of particle 1 before the measurement. Thus in 
this case, we can say that Alice has instantaneously teleported the quantum state of 
particle 1 to Bob. This is a dramatic illustration of the “spooky action at a distance” 
of entangled states that so troubled Einstein. Notice that as a result of this Bell- 
state measurement, particle 1 has become entangled with particle 2. In this case, 
these particles are in the state which shows no vestige of the state | \j/ x ). The 

original state is destroyed during teleportation. 

But of course, Alice's Bell-state measurements also yield particles 1 and 2 in 
the states |4>|7), I'tp’), and each with 25 percent probability. Even in these 

cases, if Alice sends Bob a message through a classical channel containing the result 
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of her measurement on particles 1 and 2, then Bob can perform an operation on his 
particle that will put it into the state |i/r). For example, if Alice tells Bob that her 
measurement yielded the state then Bob, whose particle then must be in the 

state — b\+z) + a|—z), need only rotate the spin state of his particle by 180° about 
the y axis to turn the state of his particle into the state particle 1 was in initially. See 
Example 5.3. 

As you may have noted, teleportation occurs instantaneously 25 percent of 
the time, which may cause you to worry about the possibility of superluminal 
communication. However, Alice’s classical message plays a crucial role. As Bennett 
et al. point out, if Bob becomes impatient and tries to complete teleportation by 
guessing Alice’s message before it arrives, then his state \ifrf) will be a random 
mixture of the four states — a|+z) 3 — b\— z> 3 , —a|+z) 3 + b\— z} 3 , h|+z) 3 + a\— z) 3 , 
and — /?|+z} 3 + a\— z) 3 , which is shown in the next section to give no information 
about the input state |i//}. 

Teleporting even the simplest quantum state such as the spin state of a spin - 3 
particle or the polarization state of a photon is not easy to accomplish in practice . 17 
Thus extension of these techniques to teleporting a macroscopic object such as a 
person is not likely to be feasible in anything like the foreseeable future. What 
makes quantum teleportation of special interest at this point is its role in high¬ 
lighting the seemingly mysterious nature and potential of entanglement in quantum 
mechanics . 18 


EXAMPLE 5.2 At the beginning of this section it was noted that it is 
impossible to clone, or copy, a quantum state. Here you are asked to prove 
the no-cloning theorem of quantum mechanics. To get started, note that 
cloning a quantum state requires a unitary operator U, a time-development 
operator if you will, that acts on a two-particle state |i/r)i|e) 2 producing the 
two-particle state |V r )ill/ r }^ that is 

U\ir)i\e ) 2 = \f)i\f ) 2 

Here we are presuming that the initial, or blank, state \e) is independent of 
the state \fj) that we want to clone, a state for which we presume we have 


17 For example, it should be noted that it is not possible to make a complete set of Bell-state 
measurements with experimental apparatus that utilizes only linear elements. See L. Vaidman and 
N. Yoran, Pkys. Rev. A 59, 116 (1999). 

18 Going forward, if efforts to develop a quantum computer are ever to be successful, quantum 
entanglement will play a key role. The field of quantum computation is still in its infancy. The 
interested reader is referred to M. A. Nielsen and I. L. Chuang, Quantum Computation and 
Quantum Information , Cambridge University Press, Cambridge, UK, 2000. 
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no prior knowledge. Consequently, the operator U must also clone the state 
\(p), namely 

f^Wikh = \<P)\\¥h 

Use the fact that U is unitary to show that cloning can occur only if 

(<p\f) = {<p \t) 2 

which is not generally true. 

SOLUTION Start by taking the inner product of the two initial states 
W)i\e)% and W) l \e) 2 

(l^bkl) (l^)ik>2) = (<PW 

since (e\e) = 1. If we now insert U^U, which is one since U is unitary, 
between the two-particle bra and two-particle ket states in the left-hand side 
of this equation, we obtain 

](<p\ 2 (e\U t U\f} l \e} t = (<p\f) 

Since U |^) 1 |e> 2 = [tA)iI 1 A }2 an d we end up with the 

condition that 

(<p\f) 2 = {<p\f) 

The requirement that x 2 = x says that either x = 1 or a = (), corresponding 
to |t If) — | cp) or {(p\i/) = 0, neither of which is true for arbitrary \ f) and \<p), 
thereby establishing the no-cloning theorem. Thus it is impossible to make an 
ensemble of particles each of which is in the same quantum state by maki ng 
copies of the quantum state of a single particle. 


EXAMPLE 5.3 Show that if Alice’s Bell-state measurement yields parti¬ 
cles 1 and 2 in the state | «J> ), then Bob can put his particle into the state 

particle 1 was in initially by rotating the spin state of this particle by 180° 
about the y axis. 

SOLUTION If Alice’s Bell-state measurement yields the state |4>j| ) ), then 
from (5.69) we see that Bob’s particle must be in the state —6|+z) + a\— z). 
Recall from Problem 3.5 that the operator that rotates spin-j states by angle 
0 about the y axis is 

R(0 j) = e-‘Sy e/fi = cos - - ~S V sin - 
2 h y 2 


Page 186 (metric system) 




5.7 The Density Operator | 171 


or in matrix terms in the .S'. basis 


m j) 


cos 


sinf 


S, basis \ sin | cos | 


Thus using matrix mechanics, again in the S, basis, 

— (° 

S 2 basis \1 0 / V a 


^Crj) (—b\+z)i + a\— z)j) 


which is the state hA) up to an overall phase. It is not hard to figure out which 
180° rotations Bob must perform if Alice’s measurement yields j'pjl') or 
|4>i,'). See Problem 5.14 and Problem 5.15. 


5.7 The Density Operator 


Let’s return to the Stem-Gerlach experiments of Chapter 1. You may recall that the 
starting point in the original Stem-Gerlach experiment was an ensemble of spin-j 
silver atoms that emerged in the form of a gas from a hole in an oven. We say that these 
atoms are unpolarized since there is no physical reason for the spin of an individual 
atom to “point” in any particular direction. The atoms were then sent through a Stem- 
Gerlach device, say one in which the gradient in its inhomogeneous magnetic field 
was oriented along the 2 axis, to select atoms that exited the device in either the state 
|+z) with S z = h/2 or the state |— z) with S z = —h/2. States such as |+z) and |—z} 
are typically referred to as pure states. But from the experimentalist’s perspective 
such states are really an idealization, since in practice it is not possible to construct an 
SG device for which the gradient is large along just one of the continuum of potential 
directions in which it may point. For example, the atoms that we characterized as 
exiting an SGx device in the state |+x) with S x = h/2 will inevitably exit with some 
range of azimuthal angles, presumably centered about 0 = 0. Thus heating the atoms 
in an oven or passing them through an SG device produces an ensemble of particles in 
a statistical mixture of pure states, a mixture that is termed a mixed state. So far our 
quantum mechanics formalism has focused exclusively on pure states. A systematic 
way to handle mixed states as well as pure states is provided by the density operator. 


DENSITY OPERATOR FOR A PURE STATE 

For a pure state \xj/), the density operator is given by 

P = \ (5.70) 

The matrix elements of the density operator are then given by 

Pii = m)bk\j) (5.71) 
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Consequently 


P*j = (j\f)(f\i) =Pji (5-72) 

which is the condition satisfied by the matrix elements for a Hermitian operator. We 
define the trace of an operator as the sum of the diagonal matrix elements: 

trp = E Pa = = I (5-73) 

i i i 

where in the penultimate step we have taken advantage of the completeness of the 
states | i). Alternatively, you can write 

J]<tA|/){/|^) = ^|(t|^)| 2 = 1 (5.74) 

i i 

which also follows from the completeness of the basis states. Note that 

p 2 = = \f)W\ = P (5.75) 

Thus tr p 2 = 1 for a pure state. 

The density operator (5.70) is the projection operator for the state 1 1 fr). Using the 
projection operator 


P\*) = 10) (4>\ (5.76) 

for the state | <j>), we see that 

tr(P| 0) p) = 

i 

= X>iv r >wi*)<w> 

i 

= (<t> \f)(fW 

= \(m)\ 2 (-5.77) 

is the probability that a measurement yields the state \<j>). And the expectation value 
for an observable A in terms of the density operator is given by 

(a> = mm) = E<^ \ i )d\Mj)u\i') = E A uPn= tr ^ ( 5 - 78 > 

ij i.j 

Finally, we can deduce the time evolution of the density operator from the 
Schrddinger equation: 
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~P (0 = <VH0I + hK0> 

= ^-H\xl/(t))(f(t)\ + -^—mt))m)\H 

ih (-in) 

= ±[H, p(t)] (5-79) 

in 


in^p(t) = {H,pm 

dt 


(5.80) 




EXAMPLE 5.4 Use the density operator to determine (S y ) for the pure 
states (a) |+z) and (b) |+y). 


SOLUTION 

(a) 

Since 


1 0 

P — i+ z X+ z l * . n „ 

S z basis \\J 0 


^ n ( ° -i 


therefore 

(S y ) = tr(S v p) = tr 


y S z basis 2 V i 0 

n / o -i' w i o 
2 \ i 0 ) V 0 0 


= tr 


h /o o 


2 Vi o 


= 0 


consistent with the fact that a measurement of S y on a spin-| particle in 
the state |+z) yields h/2 apd -/i/2, each with 50 percent probability. 


(b) p = |+y)(+y| 


= 4(| + z> + i|-z))-L«+z|-/(- z l) T -> )■( 

y/2 y/2 s, basis 2 V' 1 


therefore 


(S y ) = xr(S v p) = tr 


h / 0 -i \ 1 / 1 -i 

2 V i 0 ) 2 V i 1 


h ( 1 -i \ h 

= tr - 

4 Vi 1 


as it must since the state |+y) is an eigenstate of S y with eigenvalue 
h/2. Notice how the relative phase between the states |+z) and |—z) 
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shows up in the off-diagonal matrix elements of the density matrix in 
the S z basis. We could, of course, have done this calculation simply in 
the S v basis, but it would not have been as instructive. 


DENSITY OPERATOR FOR A MIXED STATE 


For a mixed state, one for which p k is the probability that a particle is in the state 
then 


where 


Thus 


P = Y p k W (k) )(f kk) \ ( 5 . 81 ) 

k 


Y Pk = 1 ( 5 . 82 ) 


= ( 5 . 83 ) 

k 


and the trace of the density operator for a mixed state is 


»^EE Pkii\f {k) ){f (k) \i) = Y p k Y^ (k) W$l^ (fe) ) = Y P k = 1 

i k k i k 

( 5 . 84 ) 

Since the density matrix is Hermitian (p tj = p*.), the density matrix can always be 
diagonalized with diagonal matrix elements given by the probabilities p k . Thus 


tr P 2 = Y p l S 1 (5-85) 

k 

The trace of fr is equal to one only for a pure state, since in that case there is a 
single nonzero p k — 1, which means pf = 1 as well. Thus tr p 2 < 1 is a telltale 
indication of a mixed state. For a mixed state, the expectation value, really an average 
of expectation values, for an observable A is given by 


<A)= Yp^ m \ A \f (k ^ 

k 

= Ypk(f (k) \n(i\Mj)u\f ik} ) 

U.k 

= Y l ~‘ * Y PkUw ik) ){f <k) \>) 

i,j k 

= Y A ‘Jpj> = tr (vTp) (5.86) 

ij 
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Figure 5.11 A mechanism for generating an ensemble of atoms in the mixed state described 
in part (b) of Example 5.5 under the presumption that the holding chamber consists of 50% 
in the state |+z) and 50% in the state f —x). 


It is straightforward to show that the probability that a measurement for a mixed state 
yields the state \<p) is given by (5.77) and the time evolution of the density operator 
for a mixed state obeys (5.80). 




EXAMPLE 5.5 Consider the density operators (a) ^|+z)(+z| + ^| — z}{—z'| 
and (b) ||+z)(+z| + ^| —x>(—x|. Construct the corresponding density ma¬ 
trices in the S z basis and show that these are density operators for mixed 
states. Calculate (S x ) for each state. 

We will see that the density operator in (a) can be used to characterize 
the unpolarized ensemble of silver atoms that exit the oven hr the original 
Stern-Gerlach experiment (see Section 1.1), whereas the density operator in 
(b) might characterize an ensemble that is generated by a 50-50 mix of atoms 
that exit an SGz device with S z = h/2 and an SGx device with S x = —hj 2, 
as illustrated in Fig. 5.11. 

SOLUTION 


(a) 


1 1 1/1 0 
P = i l+zH+,| + i l^)M - =ri ( 0 


Note that 


1/10 


S z basis 4 \ 0 1 


and thus tr p 2 — signifying a mixed state. 


{S x ) = tr (5/p) = tr 


h( 0 1 \ I / 1 0 
2 V 1 0 / 2 V 0 1 


H /0 1 

= tr — ( 1=0 

4 V 1 0, 


which is what we would expect for an ensemble of particles half of 
which have S , = h/2 and half of which have S. = —h/2. 
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P = ^l+z)(+z| + 2 x)( xl 

= ^l+z){+z| + ^(|+z) - |- z ))((+ z l “ { 'll) 

Therefore 


°) + if 1 = -') 

s z basis 2 \ 0 0/ 4 \ — 1 1/ 4 V — 1 1/ 

Consequently, 



S z basis 


l 

8 


5 

-2 


-2 

1 


for which the trace is | | again signifying a mixed state. 


($ x ) = tr(5 l/ o) = tr 


: tr 



which is what you would expect for an ensemble of particles for which 
half are in the state |+z), which has {S x ) = 0, and half are in the state 
t —x>, which has (S x ) = — h/2. 

Caution: Although we have argued that our results for { S x ) are 
consistent with an ensemble of spin-^ particles with a certain fraction 
of the particles in one pure state and a certain fraction in another pure 
state, you cannot infer from the form of the density operator for a 
mixed state that the ensemble is indeed in this particular mixture of 
pure states. For example, take the density operator 

P = ^|+X)(+x| + ~ | x) { x| 

Expressing this operator in terms of the |+z)-|—z) basis states, we see 
that 


P = j(I+z) + |-z))«+z| + <-z|) + i(l+z) - |-z»({+z| - { z|) 

*4 4 

= ^l+z)(+z| + “! P){ zj 

namely, the same density operator that we examined in part (a) and 
characterized as an ensemble with half the particles having S z = H/2 
and half having S z = —h/2. Now we see that we could just as eas¬ 
ily characterize this state as a mixture with half the particles having 
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S x — h/2 and half having S x = —H/2. It can also be shown (see Prob¬ 
lem 5.19) that the density operator ^|+z){+zi + ||—z}{—z| is the 
same as the density operator 

p = ^|+n){+n| + ^|—n}<—n| 

Thus this mixed state can be viewed as having half the particles with 
S„ = h/2 and half with S n = —h/2 for any direction n. Consequently, 
the density operator ||+z)(+z| + J;|-z)(-z| characterizes a com¬ 
pletely unpolarized collection ofspin-| particles. 

This example illustrates an important point. Seemingly different mixed 
states that correspond to the same density operator are really the same 
quantum state, since the expectation value of every observable is the same 
for each of these ensembles. 


EXAMPLE 5.6 The spin Hamiltonian for a spin-{ particle in a magnetic 
field B = B k is 

H = —ji B = —fi : B 

where 


/C = 


8 e , 
2m c 


for a particle with charge q = —e. Use the density operator for an ensemble 
of N of these particles in thermal equilibrium at temperature T to show that 
the magnetization M (the average magnetic dipole moment of the ensemble) 
is given by ^ 


M = N(fi z ) — Nil tanh 


liB 

klf 


where k B is the Boltzmann constant and /r = geh/Amc. As noted earlier, for 

I 

an electron g is almost exactly 2, and therefore q = eh/2m e c = q B , where 
/r B is called the Bohr magneton. 

SOLUTION First note that 

H |±z) = ——B |±z) = ik(iB |±z) 

4 me 

Therefore the energies of the states |+z) and | —z) are /iB and — jiB , re¬ 
spectively. The relative probabilities that the particles in the ensemble are in 
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the states |+z) and |—z) are dictated by the Boltzmann factor e . The 
density operator is given by 


where 


p—H B/kftT ptxBIk^T 

---|+z)(+z| +---| 


2 = _|_ e nB/k B T 


-z)(-z| 


The corresponding density matrix is 


S z basis Z 


-flB/kftT 


„liB/k B T 


We have inserted the 1/Z factor in the density operator so that the trace of 
the density matrix is one, which is equivalent to ensuring the probabilities of 
being in the states |+z) and [—z) sum to one. (In statistical mechanics, Z is 
called the partition function.) Then 


M = N{fi z ) = N tr (p/i z ) 


= -yv 


SiUtr 

4 me) Z 




glxB/kftT 


0 -1 


= HtL {e BB/kvT _ e -ixB/k B Tj 


= N/JL 


e nB/k B T _ e -ixB/k B T 
e nB/k B T _|_ g-fiB/k B T 


= Nn tanh -— 
k h T 

An interesting limiting case occurs when /iB/k^T 1. Then we can 
approximate the magnetization by 

k n T 

since tanh x ~ x when x « 1. This 1/7’ dependence of the magnetization for 
a paramagnetic system (one in which the particles have a permanent magnetic 
dipole moment) is referred to as Curie’s law, since it was first discovered 
experimentally by Pierre Curie. Curie’s law is often expressed in the form 


M — C- 


where C is called the Curie constant. The value of C varies with the spin of 
the particle. For spin-1 particles, see Problem 5.22. 
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MULTIPARTICLE SYSTEMS 

Let’s start with the two-particle pure state 

\f n ) = |+z),|+z> 2 (5 ' 87) 


The density operator is 

P = I = (l+z)il+z)2)(t(+ z l2(+ z l) ^ 5 - 88) 

To determine the expectation value of the z component of the spin angular momen¬ 
tum for particle 1 we calculate 

(S l2 ) = tr(S lz p) (5 ’ 89) 

where by S u we really mean S lz ® 1, the direct product of S lz and the identity 
operator for particle 2. Thus in calculating (S lz ) we are effectively using something 
called the reduced density operator p (l) , which is obtained from the full density 
operator by tracing over the diagonal matrix elements of particle 2 (since the particle- 
2 operator is the identity operator): 

P (I > = £20«>2 (5 - 90) 

j 


Thus for the pure state (5.87) 

p = (|+z) 1 |+z) 2 )(i(+ z l2(+ z l) P (l) = (|+Z) 1 )(,(+Z|) (5.91) 

For the entangled two-particle state 

Uhi) = ^l+^tl- z >2 - -^=|— z >iH-z>2 ( 5 ' 92) 

that was the focus of our discussion of the Bell inequalities in Section 5.5, the density 
operator is given by 

p = i (|+Z) 1 |-Z) 2 - |-Z) 1 |+Z) 2 ) (ji+zbi-zl - !<— Z| 2 (+z|) (5.93) 

The corresponding density matrix using the states (5.8) as a basis is 

0 0 0 

0 1 -1 

0 -1 1 

0 0 0 



0 \ 

0 

0 

0/ 


(5.94) 


Page 195 (metric system) 



180 | 5. A System of Two Spin-1/2 Particles 


The local realist would prefer to think that the particles in the state (5.92) are in a 
50-50 mix of the two states |+z) 1 |-z) 2 and |-z),|+z) 2 , for which the corresponding 
density operator is 


p = \ (l+zhl-zh) (i<+z| 2 (“Z|) + ~ (|-z>,]+z> 2 ) ( 1 (-z| 2 <+z|) (5.95) 
and the corresponding density matrix is 


(o 

1 ° 

2 0 

u 


0 

1 

0 

0 


0 

0 

1 

0 


o\ 

0 

0 

<v 


(5.96) 


Comparing the density matrices (5.96) and (5.94), we see that the true superposition 
embodied in the pure state (5.92) reveals itself in the density operator formalism in 
the presence of off-diagonal matrix elements in the density matrix, thus making it 
easy to distinguish an entangled state from a mixed state. 

Moreover, by tracing over the diagonal matrix elements for particle two in the 
two-particle matrix (5.94) for the entangled state (5.92), we obtain the reduced 
density operator 


P <n = - (l+z)i) (i(+z|) + ~ (|-zh) (i(-z|) (5.97) 

Thus for measurements solely of particle 1 the pure state (5.92) behaves like the 
mixed state of a completely unpolarized ensemble. This is another way of showing 
that measurements made only on a single particle that is part of the two-particle 
entangled state (5.92) do not contain any information about the other particle. This 
result also illustrates one of the hallmarks of quantum entanglement—namely, the 
reduced density operator for a single particle that is entangled with another particle 
in a pure state is a mixed state. 

Finally, take a look at the pure three-particle state | i/r l23 ) that was the centerpiece 
of our discussion of quantum teleportation in the previous section [see (5.66)]. The 
corresponding density operator for this pure state is given by 

P = 1 ^ 123 } (V r i23! (5.98) 

Alter Alice carries out her Bell-state measurement, the density matrix for an ensem¬ 
ble of particles in this three-particle system is the mixed state 
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(-a|+z} 3 -&|-z> 3 ) (vpg’l (-a*i(+z\ - b* % (-z\) 

+ "I'J'ia ! > ( — ■M+ z ).3 + b\—z)^ {^[2 'l (■“ fl *3(+ z l + b*i(~ z l) 

+ “1^12 *) (^l+ z )3 + d\~ z h) (*^12 *1 (^*3(+ z l + a *l(~ z l) 

+ (“^l + z )3 + a \— z) 3 ) (— b* i {+z\ + a* 3 (—z|) (5.99) 

The reduced density matrix that describes the results of measurements made by Bob 
if he chooses not to wait for the information from Alice telling him the result of her 
measurement is obtained by tracing over particles 1 and 2, leading to 

P (M = ~ (Ml 2 + Ml 2 ) (|+z) 3 ) ( 3 {+z|) + ^ (|a| 2 + M| 2 ) (|- z > 3 ) ( 3 {-z|) (5.100) 

Since Ml 2 + |h| 2 = 1, we see that 

P {i) ~ 2 (l+ z ).3) (.3(+ z l) + ~ (I z ) 3 ) (s( z l) (5.101) 

which is equivalent to a completely unpolarized state, showing that Bob has no 
information about the state of the particle Alice is attempting to teleport. On the 
other hand, as we saw in Example 5.3, if Bob waits until he receives the result of 
Alice’s Bell-state measurement, he can then maneuver his particle into the state 1 1 jr) 
that Alice’s particle was in initially. 

5.8 Summary 


In this chapter we have examined the quantum states of two particles. We use a 
number of different notations to specify a two-particle state. A general state | xfr) can 
be expressed in terms of the single-particle states by 

I^ = E c O-K>i®M 3 >2 (5.102) 

41 

where in this form we are not presuming that the states \a,) for particle 1 are 
necessarily the same as the states \bj) for particle 2. The symbol <g> indicates a special 
product, called a direct product, of the kets from the two different vector spaces. We 
often dispense with the direct-product symbol and simply write 

1 1r) = Y i c l j\a i ) l \bj) 2 (5.103) 

u 

or 

\f) = E MjMo bj) = E Mo. bj)(a h bj\f) (5.104) 

ij ij 
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where we understand the ket \a h bj) to be a two-particle ket with particle 1 in the 
state | a t ) and particle 2 in the state | bj). 

Because our discussion of quantum mechanics so far has emphasized the spin 
degrees of freedom of a spin-4 particle, we have focused our attention in this chapter 
on the spin states of a system of two spin-1 particles. In particular, we have discovered 
that the eigenstates of total spin angular momentum S = Sj -f S 2 are given by 


|1. 1) = l+Z, +Z> 

(5.105a) 

|1» o) = -L+z, —z) + -L|-z, +z> 

V2 -/l 

(5.105b) 

I, -1) = l“Z, -z) 

(5.105c) 

|0, 0) = | —f-z, -z)-j=|-z, +z) 

y/2 V2 

(5.106) 


where the labels for the kets on the left-hand side are just the total angular momentum 
states 


S 2 |^, m) = .vf.v + l)fi 2 |.v, m) (5.107a) 

S z |s, m) — mh\s, m) (5.107b) 

with S the total spin. As for any angular momentum, we can use one of the compo¬ 
nents and the square of the magnitude of the total spin to label the total-spin states. 
The states in (5.105) form a triplet of total-spin-1 states, since s = 1 for these three 
states (m = 1, 0, —I), while the state (5.106) is a singlet s = 0 state. Notice in par¬ 
ticular that the spin-1 states are unchanged (symmetric) if you exchange the spins 
of each of the particles, while the spin-0 state changes sign (antisymmetric) under 
exchange of the spins of the two particles. These spin states will play a pivotal role 
in our discussion of the possible states of identical particles in Chapter 12. 

Although states such as (5.105) and (5.106) are natural extensions of our speci¬ 
fication of a quantum state for a single particle to that for two particles, experiments 
carried out on states such as these tell us much about the physical nature of reality. 
The total-spin-0 state has been our laboratory for an investigation of the correlations 
that exist between the measurements of the individual spins of the two particles in 
such a state. These correlations have been shown to be in experimental disagreement 
with those that would occur if the particles were each to possess definite attributes. 
In fact, in measurements on a two-particle spin state, measurements on one of the 
particles can fix the result of a measurement of the other particle, even though the 
particles may be separated by arbitrarily large distances and neither of the parti¬ 
cles possess a definite attribute before the measurement. Although this may seen 
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paradoxical, it is a natural outcome of applying quantum mechanics to an entangled 
two-particle system. 

Entanglement has a precise definition in quantum mechanics. We say a two- 
particle state consisting of particles 1 and 2 is entangled if it cannot be factored 
into a direct product of single particle states, that is, it cannot be written in the form 
|a)l ® | b) 2 = \a)\\b) 2 = \a , b). Thus the total-spin-0 state (5.106), the centerpiece 
of our discussion of the EPR paradox, is a classic example of an entangled state. We 
have seen in Section 5.6 how entanglement can be utilized to teleport the quantum 
state of a particle. 

In comparing experimental results with the predictions of quantum mechanics, 
it is often the case that the experiment will generate an ensemble of particles in a 
statistical mixture of pure states. Such a mixture is referred to as a mixed state. For 
a mixed state, one for which p k is the probability that a particle is in the pure state 
[ithe density operator p is given by 

P = ]T>t \f (k) )dr m \ (5.108) 

k 

The trace of the density operator is the sum of the diagonal matrix elements: 

*P = E E Pk{i\f m )(f ik) \i) = E Pk = 1 (5-109) 

i k k 

The expectation value for an observable A is given by 

(A) = tr(Ap) (5.110) 


Problems 


5.1. Take the spin Hamiltonian for the hydrogen atom in an external magnetic field 

B 0 in the z direction to be ), 

H — —-Sq • S 2 + m 0 S ]z 
n- 

where a> {) = geB (j /2mc, with m the mass of the electron. The contribution - ft 2 ■ B 0 
of the proton has been neglected because the mass of the proton is roughly 2000 times 
larger than the mass of the electron. Determine the energies of this system. Examine 
your results in the limiting cases A » hto i} and A <§; ha> 0 by expanding the energy 
eigenvalues in a Taylor series or binomial expansion through first nonvanishing order. 

5.2. Express the total-spin s — 1 states of two spin-j particles given in (5.30a) and 
(5.30c) in terms of the states |+x, +x), |+x, —x), |-x, +x), and |—x, —x). 
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5.3. Express the total-spin s — 0 state of two spin-/ particles given in (5.31) in terms 
of the states |+n, -n) and |— n, +n), where for a single spin-1 particle 


0 

|-Hi) = cos -|+z) + e l<t> 
|—n) = sin -j+z) — e'^ 


sin - — z) 
2 


cos - —z) 
2 


5.4. In an EPR-type experiment, two spin-| particles are emitted in the state 

IE 1} = |+*T +*) 


A and B have their SG devices oriented along the x axis. Determine the probabilities 
that the resulting measurements tind the two particles in the states |1, l) t . j 1, ()} t , 
and If, -1}*. 


5.5. At time t = 0, an electron and a positron are formed in a state with total spin 
angular momentum equal to zero, perhaps from the decay of a spinless particle. The 
particles are situated in a uniform magnetic field B 0 in the z direction. 

(a) If interaction between the electron and the positron may be neglected, show 
that the spin Hamiltonian of the system may be written as 

H = c%(S lz — S 2z ) 

where S, is the spin operator of the electron, S 2 is the spin operator of the 
positron, and o> {) is a constant. 

(b) What is the spin state of the system at time ;? Show that the state of the 
system oscillates between a spin-0 and a spin-1 state. Determine the period 
of oscillation. 

(c) At time /, measurements are made of .S',, and S 2x . Calculate the probability 
that both of these measurements yield the value ft/2. 

5.6. Take the spin Hamiltonian of thepositronium atom (abound state of an electron 
and a positron) in an external magnetic field in the z direction to be 

2A ~ 

H = H + m 0 (S lz - %) 

Determine the energy eigenvalues. 

5.7. Determine the four states with s — 4 that can be formed by three spin-j par¬ 
ticles. Suggestion: Start with the state ||, |) and apply the lowering operator as in 
(5.36). 


5.8. Measurements of the spin components along two arbitrary directions a and b 
are performed on two spin-j particles in the singlet state |0, 0). The results of each 
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measurement are denoted by ±1, depending on whether the measurement finds the 
particle spin-up or spin-down along that particular axis. Denoting by P ±± (a, b) 
the probabilities of obtaining ±1 along a for particle 1 and ±1 along b for particle 2, 
the average value for the product of spins is given by 

E( a, b) = P ++ (a, b) + P _ (a, b) - P + _( a, b) - P_ + (a, b) 


Show that 


E( a, b) = - cos 6 ah 

where 9 ab is the angle between a and b. If a and b are unit vectors, then we can write 
£(a, b) = — a ■ b. 


5.9. Show that 

(0, 0\S la S 2h \0, 0) = — cos 9 ah 
4 

where 9 ab is the angle between a and b. Therefore 

£(a, b) = i{0.0|S k S 2 ,|0, 0} 
where E{ a, b) is defined in Problem 5.8. 


5.10. As noted in Section 5.5, tests of the Bell inequalities are typically carried out on 
entangled states of photons. We can label the linear polarization states of the photons 
in a manner analogous to the way we labeled the spin-up and spin-down states for a 
spin-j particle in Problem 5.8. Namely, we assign the value +1 if a measurement by 
Alice (Bob) finds the photon to be horizontally polarized along an axis labeled by the 
unit vector a (b) and a value — l if the measurement finds the photon to be vertically 
polarized, that is polarized along an axis perpendicular to a (b). Asjn Problem 5.8, 
we then define M 


E( a, b) = P ++ ( a, b) + P__(a. b) - P + _(a. b) - P_ + (a, b) 


Show for the two-photon entangled states 


\f) = 


1 

v/2 


\x,x) 


± 


1 

V2 


I y,y) 


that 


E( a, b) = cos 29 ab 

Compare with the data in Fig. 5.9. Note the different dependence of £(a, b) on °ab 
for photons in comparison with that for spin-^ particles in Problem 5.8. 
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a 



Figure 5,12 Orientation of the analyzers for Prob¬ 
lem 5.12, 


5.11. Repeat the calculation of 


E( a, b) = P ++ ( a, b) + P _ (a, b) - P + U a, b) - P_ + (a. b) 

in Problem 5.10 for the two-photon entangled states 

W) = -U,y> ± -7=|y, 

\/2 V2 

5.12. Using the correlation function E( a, b) from Problem 5.10, define 
5 = E( a, b) - E( a, b') + £(a\ b) + £(a\ b') 


where 5 involves four measurements in four different orientations of the polariz¬ 
ers. Evaluate S for the set of orientations in which Q ab = 0 ba > = 6 a , b , = 22.5° and 
0 ab ' = 61.5°, as shown in Fig. 5.12. It can be shown that -2 < S < 2 in a local 
hidden-variable theory. The experiment of Aspect et al. discussed in Section 5.5 
yields S expt = 2.697 ± 0.015. This particular choice of angles yields the greatest 
conflict between a quantum mechanical calculation of S and the Bell’s inequality 
-2<S<2. 


5.13. The annihilation of positronium in its ground state produces two photons that 
travel back to back in the positronium rest frame along an axis taken to be the z axis. 
The polarization state of the two-photon system is given by 

\if) = ~\R, R) - \=\L, L) 

V2 s/2 

(a) What is the probability that a measurement of the circular polarization state 
of the two photons will find them both right-handed? Both left-handed' 7 

(b) What is the probability that photon 1 will be found to be x polarized and 
photon 2 will be found to be y polarized, that is, that the system is in the state 
\x, y)2 What is the probability that the system is in the state [y, x)? 
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(c) Compare the probability for the two photons to be in the state \X, x) or in the 
state |y, y) with what you would obtain if the two-photon state were either 
\ R, R) or |Z_, L) rather than the superposition |t/r). 

Note: Since the photons are traveling back to back along the z axis, if photon 1 
is traveling in the positive z direction, then photon 2 is traveling in the negative z 
direction. Consequently, 

|/?)t =-^=|x) 2 —-^=|y}2 and \L) 2 = ~^\ x )i + -y=|y>2 

5.14. The first part of this problem follows the procedure outlined in Problem 3.5. 
The operator 


RWi) - c" iS ^ h 


rotates spin states by an angle 8 counterclockwise about the x axis. 

(a) Show that this rotation operator can be expressed in the form 

0 2i * 9 

R(8 1 ) = cos-5, sin - 

2 h 2 

Suggestion: Use the states |+z) and |— z) as a basis. Express the operator 
R(8 i) in matrix form by expanding R in a Taylor series. Examine the explicit 
form for the matrices representing S~, ir’, and so on. 

(b) Use matrix mechanics to determine which of the results T||’> or |d>p ! ) of 
Alice’s Bell-state measurement yields a state for Bob’s particle that is rotated 
into the state [i j/) by the operator R(ni). 


5.15. Determine which of the results of Alice’s Bell-state measurement yields a state 
for Bob’s particle that is rotated into the state |r/r> by the operator Rink). If you have 
worked out Problem 5.14, you can simply verify that the remaining state for Bob’s 
particle is indeed rotated into the state |i/r) by a 180° rotation about the z axis. 


5.16. A spin-^ particle is in the pure state |t/r) = a +x) + b\— z). 

(a) Construct the density matrix in the S z basis for this state. 

(b) Starting with your result in (a), determine the density matrix in the S x basis, 
where 


|+X) = —r|+Z) 
v2 


V2 




(c) Use your result for the density matrix in (b) to determine the probability that 
a measurement of S x yields ft/2 for the state yV). 
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5.17. Given the density operator 

P = ~ (l+z)<+zj + 1—z}<—z| - |-z)<+z| - |+z)<-z|) 

construct the density matrix. Use the density operator formalism to calculate (S x ) 
for this state. Is this the density operator for a pure state? Justify your answer in two 
different ways. 

5.18. Given the density operator 

P = 7l+z)<+z| + 41—z) <—z| 

4 4 

construct the density matrix. Show that this is the density operator for a mixed state. 
Determine { S x ), (S v ), and {S z ) for this state. 

5.19. Show that 

P = ^|+n)(+n| + ^|—n){—n| = ^|+z)<+z| + ^|-z}(-z| 

where 


0 ; 

* . 9 

|+n) = cos -|+z) + e H 

* sin - —z 
2 

0 

. 9, , 

1— n} = sin - +z) — e l<> 

cos - — z 

2 

2 


5.20. Find states |i/r,) and 1 1 // 2 ) for which the density operator in Problem 5.18 can 
be expressed in the form 

p = ^I^KV'tI + 

5.21. The density matrix for an ensemble of spin-j particles in the S, basis is 

f — *( 4 n ) 

Sr basis \ n* p / 

(a) What value must p have? Why? 

(b) What value(s) must n have for the density matrix to represent a pure state? 

(c) What pure state is represented when n takes its maximum possible real value? 
Express your answer in terms of the state |+n) given in Problem 5.19. 

5.22. Show that the Curie constant for an ensemble of N spin-1 particles of mass 
m and charge q = —e immersed in a uniform magnetic field B = Sk is given by 

2 Njj? 

3k B 
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where fx = geh/lmc. Compare this value for C with that for an ensemble of spin-4 
particles, as determined in Example 5.6. 


5.23. An attempt to perform a Bell-state measurement on two photons produces a 
mixed state, one in which the two photons are in the entangled state 


A=\ x > x ) + -4i y* y) 

s/2 V2 


with probability p and with probability (1 - p)/2 in each of the states (x, x) and 
| y, y). Determine the density matrix for this ensemble using the linear polarization 
states of the photons as basis states. 


5.24. Show that the Bell basis state 


|<i> (+) ) = -pl+ z ’ + z ) + "~7=I~ Z ’ ~z) 
V2 sjl 


in the S f basis is given by 


Id> (+) } = —~|+x, +x) + ~ | X, -x> 


V2 


V2' 


Also show that 


<£(->) _ |+ z , +z)-7=I~ Z - _z ) 

s/'l s/2 

1 , I , 

- • r -| 1 X. -x) + — | —X, +x) 

s/2 s/2 


5.25. Show for the density operator for a mixed state 


p = Yh Pfc W {k) )iV k) \ 

k 


that the probability of obtaining the state \<f>) as a result of a measurement is given 
by tr(P j(W yo), where P ](j>) = \4>){<p\. 


5.26. Use the density operator formalism to show that the probability that a mea¬ 
surement finds two spin-! particles in the state |+x, +x) differs for the pure Bell 
state 


d> ( U\ = 


1 , v 1 , 
—=|+z, +'i) H -=| 

V2 s/2 


-z, -z) 


for which 


p = |cp (+) ){ct) (+ >| 
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and for the mixed state 

p = ^|+z, +z)(+z, +z| + ^[-z, z) { z, z| 

Thus, the disagreement between the predictions of quantum mechanics for the 
entangled state |<t> (+) ) and those consistent with the views of a local realist are 
apparent without having to resort to Bell inequalities. 

5.27. Show that tr(AB)= tr(BA). 

5.28. Show that the equation governing time evolution of the density operator for a 
mixed state is given by 

ih^-p(t) = [H, p(t )] 
dt 

5.29. 

(a) Show that the time evolution of the density operator is given by 

pit) = u(t)p(0)uHt) 
where U(t) is the time-evolution operator, namely 

uum(O)) = m)) 

(b) Suppose that an ensemble of particles is in a pure state at t = 0. Show 
the ensemble cannot evolve into a mixed state as long as time evolution is 
governed by the Schrodinger equation. 
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CHAPTER 6 

Wave Mechanics in One Dimension 


Thus far, our discussion of quantum mechanics has concentrated on two-state sys¬ 
tems, with most of the emphasis on the spin states of a spin-| particle. But particles 
have more degrees of freedom than just intrinsic spin. We will now begin to discuss 
the results of measuring a particle's position or momentum. In this chapter we will 
concentrate on one dimension and neglect the spin degrees of freedom. This marks 
the beginning of our discussion of w'ave mechanics. 


6.1 Position Eigenstates and the Wave Function 


When we want to analyze the results of measuring the intrinsic spin S, of a spin- j 
particle, we express the state j iff } of the particle as a superposition of the eigenstates 
|+z) and | —z) of the operator S z . If we are interested in measuring the position x of 
the particle, it is natural to introduce position states |x) satisfying 

x\x) x\x) (6.1) 

where x is the position operator and the value of x runs over all possible values of 
the position of the particle, that is, from — oo to +oo. 

Strictly speaking, such position eigenstates are a mathematical abstraction. In 
contrast to the measurement of the intrinsic spin S z of a spin-j particle, where we 
always obtain either h/2 or - h/2. we cannot obtain a single value for the position of 
a particle when we try to measure it. As an example, Fig. 6.1 shows a schematic of a 
microscope that might be used to determine the position of a particle. Light scattered 
by the particle is focused by the lens on a screen. The resolution of the microscope— 
the precision with which the position of the particle can be determined—is given by 

Ax ~ — (6.2) 

sin 4> 

191 
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Figure 6.1 A microscope for determining the 
position of a particle. 


where X is the wavelength of the light and the angle (p is shown in the figure. The 
physical cause of this inherent uncertainty in the position is the diffraction pattern 
that is formed on the screen when light passes through the lens. We can make the 
resolution sharper and sharper by using light of shorter and shorter wavelength, but 
no matter how high the energy of the photons that are used in the microscope, we 
will never do better than localizing the position of the particle to some range Ax in 
position. Thus, as this example suggests, we cannot prepare a particle in a state with 
a definite position by making a position measurement. In fact, as our discussion of 
the Hi nstei n-Podol sky Rosen paradox emphasized, we should not try to view the 
particle as having a definite position at all. 

Although it is not possible to obtain a single value for the measurement of the 
position of the particle, nonetheless kets such as \x) in which the particle has a 
single position are very useful. We may think of the physical states that occur in 
nature as a superposition of these position eigenstates. How should we express this 
superposition? If we try to mimic the formalism that we used for intrinsic spin with 
its discrete eigenstates and write 

\f) = ^2 I x i)( x M) 

i 

where the sum runs over all the values of the position, we can quickly see that we 
have a serious problem. Writing the bra (ip as 
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allows us to calculate 


j 


= (flXjHxj\Xi)$Xj\ilr) 

u 

Again, if we mimic our earlier formalism and use [x } |x ( - > = which states that the 
probability amplitude is equal to unity if the positions are the same and equal to zero 
if they are different, we obtain 

w w) = Y! w*i>(*/i = Y2 i^i , A>i 2 

i i 

Let’s consider a point x t =a where is not zero. Then we may choose a 

sufficiently nearby point x ( = a + Ax where {a + Ax \ x{r) is also not zero. However, 
there are infinitely many points between a and a + Ax, no matter how small we 
choose Ax. Thus 


X ] K ^;!^)! 2 = °° 


and we are unable to satisfy the condition ( i//h//) = I. 

Our way of expressing | \fr} as a discrete sum of position states cannot work when 
we are dealing with a variable like position that takes on a continuum of values. 
Instead of a sum, we need an integral: 


W) = 



dx \x) (a*|t/t} 


(6.3) 


Now the coefficient of the position ket |x) is dx (x \ ) so that if we integrate only 
from a to a + Ax, we obtain a contribution to \fi) of 


a+Ax 

dx \x){x\\js) 

which vanishes for vanishingly small Ax. Examination of (6.3) shows that the 
generalization of the completeness relation (2.48) to kets like those of position that 
take on a continuum rather than a discrete set of values is 



dx |x)(x| = 1 


(6.4) 


In order to see how these position kets should be normalized, let’s consider the 
special case where the ket |i (r) in (6.3) is itself a position state |x'). Then 


/ OO 

dx |x)(x|x / ) 

-OO 


(6.5) 
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which implies that 


{x\x') = <5(x — x'y 


( 6 . 6 ) 


where S(x — x) is a Dirac delta function . 1 It is comforting to note that when 
x 7 ^ x', the amplitude to hnd a position state |x') at position x vanishes. It is, 
however, somewhat disquieting to see that when x = x', the amplitude is infinite. 
Let’s recall that we are not able to make measurements of a single position out of 
the continuum of possible positions. Thus an amplitude like {x\x’) is not directly 
related to a physically observable quantity. Dirac delta functions always appear 
within an integral whenever we are calculating anything physical. For example, the 
bra corresponding to the ket (6.3) is 

/ OO 

dx (f \x)(x\ (6.7) 

-OO 


We now calculate 


= J dx (f\x)(x\f) = J dx\{x\f)\ 2 


( 6 . 8 ) 


where we have used a different dummy integration variable for the bra equation 
(6.7) than for the corresponding ket equation (6.3) because there are two separate 
integrals to be carried out when evaluating (t/r|t/r). Note that we could also have 
obtained this result by inserting the identity operator (6.4) between the bra and the 
ket in {ifr\\fr). Unless stated otherwise, the integrals are presumed to run over all 
space. The requirement that (?// j t//} = 1 becomes 


1 = j dx {\fr\x}{x\ifr) = J dx\{x\ir}\ 2 
It is natural to identify 


(6.9) 


dx\(x\f}\ 2 ( 6 . 10 ) 

with the probability of finding the particle between x and x + dx if a measurement of 
position is carried out, as first suggested by M. Bom. The requirement that (^r\\jr) = 1 
then ensures that the total probability of finding the particle in position space is unity. 
The complex number (x \ i//) is the amplitude to find a particle in the state |yV) at 


1 Dirac delta functions are discussed in Appendix C. 


Page 210 (metric system) 


6.2 The Translation Operator | 195 


position x. This amplitude will, in general, have different values for each different 
value of x; namely, it is a function, which we call the wave function f(x) of wave 
mechanics: 


(x\f) = f(x) 

In terms of t/r(x), (6.9) may be written 


/ 


1=/ dx (x)^lf {x) = / dx \is(x)\ 


/ 


( 6 . 11 ) 


( 6 . 12 ) 


Finally, as we saw in Chapter 2, for an observable A the expectation value is 
given by 


{A} = (f\A\l/,) 

Thus the average position of the particle is given by 

{x) = {f\x\rjf) = j dx {f\x\x)(x\f) 

=J<ur(ww=fd* if mi 2 * 


(6.13) 


(6.14) 


which is the “sum” over all positions x times the probability of finding the particle 
between x and x +- dx. 


EXAMPLE 6.1 Presuming we know the wave functions {x\(p) and {x|i/r}, 
evaluate the amplitude (<p\ifr). 

SOLUTION We evaluate the amplitude ((p\i/) by inserting the identity 
operator (6.4) between the bra vector and the ket vector: 


iv\f) 


/ 


dx {<p\x)(x\\jj) = I dx (p*{x)\p (a) 


/ 


6.2 The Translation Operator 


The natural operation to perform on our one-dimensional position basis states is to 
translate them: 


f (a)\x) = |x + a) 


(6.15) 
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Figure 6.2 (a) Areal wave function \(r(x) = (x|r/r) and (b) the wave 
function of the translated state f(x) = (x\i(r') = ifr(x — a). 


The operator T(a) changes a state of a particle in which the particle has position x 
to one in which the particle has position x + a. In order to determine the action of 
the translation operator on an arbitrary state 1 i//), 

W) = T(aM) (6.16) 

we express \xj/) as a superposition of position states. Then 

f (a)\rj/) = T(a) j dx'\x f ){x'\^r) = J dx' \x' + a){x'\^) (6.17) 

We have called the dummy variable in the integral x' because we want to calculate 
the amplitude to find this translated state at the position x: 

if'(x) = {x\f') = (x\T(a)\f) 

= J dx' {x\x' + a){x'\yjf) = J dx' 5[x - (x + a)]{x'\t//} 

= {x - a\xj/) — \jr(x - a) (6.18) 

At first it might seem strange that f' (x) ^ f(x + a). But if, for example, xjr{x) has 
its maximum at x — b, then ’f'(x) = i js(x — a) has its maximum at x — a — b, or 
x = a + b, as shown in Fig. 6.2. The state has indeed been translated in the positive 
x direction. 

Notice that the translation operator must be a unitary operator, since translating 
a state should not affect its normalization, that is, 

(f'\t') - mt\a)T(aM) = (*\ir) (6.19) 

which requires 

f f (a)f(a) — 1 (6.20) 

We can take advantage of this result to derive (6.18) in another way. From (6.15) 
we know that 

(x|7’ f (a) = {x + a\ (6.21) 
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But since the translation operator is unitary, 7 f is the inverse of T. Thus if 7’ i 
translates a bra vector by a, then T translates it by —a: 

(x\f(a) = (x-a\ (6.22) 

Therefore 

{x\f(a)\i/r) = (x — a\i!/) = t/r(x — a) (6.23) 


as before. 

This result is reminiscent of our discussion in Section 2.5 on active versus passive 
rotations. Here we have introduced the translation operator T(a) as an operator 
that translates the ket | rjr) by a in the positive x direction, creating a new ket |i/r'). 
When we examine the wave function of this translated state, we see that we can 
also consider this wave function as the amplitude for the original ket |i//} to be 
in position states that have been translated by a in the negative x direction. Thus 
an active translation on the state itself is equivalent to a passive translation in the 
opposite direction on the basis states that are used to construct the wave function. 2 

6.3 The Generator of Translations 


We next consider the infinitesimal translation operator 

f (dx) = 1 — — p x dx (6.24) 

h 

whose action on a position ket is given by 

f (dx)\x) = \x + dx) (6.25) 

to first order in the infinitesimal dx. The operator p x is called the generator of 

translations. We can generate a finite translation by a through the application of 
an infinite number of infinitesimal translations: 

N 

= (6.26) 

Recall that the infinitesimal rotation operator is given by R(dipk) — 1 — iJ z d<p/h, 
where the generator of rotations J z has the dimensions of angular momentum and is 
the operator for the z component of the angular momentum. Also, the infinitesimal 


t ( a) = lim 

N-+ oo 


i , a 
h Px 


2 In some texts it is not uncommon to define the translation operator as one that shifts the 
argument of the wave function in the positive x direction by a. In those texts, the translation 
operator will be the inverse of the one we have introduced here and will actually translate the 
physical state in the negative x direction. 
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time-translation operator is given by U{dt) = 1 — iHdtJh , where the generator H 
has the dimensions of energy and is the energy operator. Since the dimensions of the 
generator of translations p x are those of linear momentum, you will probably not be 
surprised to discover that it is indeed the operator for the x component of the linear 
momentum. 

In order to justify this assertion, we need to examine two important properties 
of the generator of translations. First, note that unitarity of the translation operator 
requires that the generator of translations must satisfy 

Px = fit (6.27) 

that is, it is a Hermitian operator. Second, we can establish that the generator of 
translations does not commute with the position operator. Consider an infinitesimal 
translation by Sx? Then 


x T(8x) - f(8x)x =x [l - jp x 8x^ - ^1 - jp x 8x^j x 

j (xfix ~ fix*) 

= (—/h I (6.28) 

This is a relationship between operators and therefore means that for any state \ f) 

(x f(8x) - T(8x ) i)| f) = ( 

If we use the expansion (6.3) for \yjr) in the position basis to evaluate the left-hand 
side of (6.29), we obtain 


[x, p x ]\f) 


(6.29) 



(x f(8x) — f(8x) x) j dx |x) (x |i/f) 

— X J dx \x + 8x)(x\ir) - f(8x) j dx x\x){x\ifr) 

— j dx {x + 8x)\x + 8x)(x\f) - j dx x\x + 8x){x\f) 

— & x j dx \x + 8x){x\xlf) = 8x j dx \x)(x\x{r) = 8x \xfr) 


(6.30) 


1 We call the inftnitesitnal translation 8x instead of dx to avoid confusion with the integration 
variable in (6.30). 
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where in the next to last step we have kept just the leading order in the infinites¬ 
imal <$jc. 4 Comparing (6.29) and (6.30), we see that the position operator and the 
generator of translations obey the commutation relation 


[x,p x \ = ih (6.31) 

Given the pivotal role that the commutation relations (3.14) played in our discussion 
of angular momentum, it is probably not surprising to find that the commutation 
relation (6.31) plays a very important role in our discussion of wave mechanics. 

In order to ascertain the physical significance of the generator of translations, we 
next examine the time evolution of a particle of mass m moving in one dimension. 
Continuing to neglect the spin degrees of freedom of the particle, 5 we can write the 
Hamiltonian as 

^2 

H = i£. + V(x) (6.32) 

2m 

where we have expressed the kinetic e nergy of the particle in terms of the momentum 
and added a potential energy term V. Note that we are denoting the momentum 
operator by the same symbol as the generator of translations. We will now show 
that for quantum mechanics to yield predictions about the time evolution that are in 
accord with classical physics when appropriate, it is necessary that the momentum 
operator satisfy the commutation relation (6.31). 

Using (4.16), we can calculate the time rate of change of the expectation value 
of the position of the particle: 

-- = l ~mH, x]\f) - J—wip xw 

dt h 2m fi 

= r^— (f\(P x lP**x]+ lP X ’*]PxM) 

2m n 

= 2!M!*<£ S ) (6.33) 

m m 

Moreover, you may also check (see Problem 6.1) that 


d ^EA = Uf\[H, p.M) 

dt fi 

_l_dV_ 

\ dx 


(6.34) 


4 If this step bothers you. see also the discussion in going from (6.38) to (6.39). Here too, we 
can shift the integration variable (*' = x 4- 3x), expand the wave function in a Taylor series, and 
retain only the leading-order term. 

5 It’s hard to worry much about angular momentum in a one-dimensional world. 
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In deriving these results, we have had to assume that the commutation relation (6.31) 
is satisfied. Thus, a necessary condition for the expectation values of position and 
momentum to obey the laws of classical physics is that the momentum operator 
and the generator of translations satisfy the same commutation relation with the 
position operator. Although it may seem somewhat abstract, the best way to define 
the momentum operator is as the generator of translations, just as we defined the 
angular momentum operators as the generators of rotations and the Hamiltonian, 
or energy operator, as the generator of time translations. Note that the momentum 
will be a constant of the motion when the Hamiltonian and the momentum operator 
commute. But since p x is the generator of translations, this means the Hamiltonian is 
translationally invariant, which is the case when V (x ) is independent of x [in accord 
with (6.34)]. 

A word of caution about (6.33) and (6.34), which are often referred to as Ehren- 
fest’s theorem, is in order. These equations do not mean that the motion of all particles 
is essentially classical. If, as in classical physics, we call — dV jdx = F(x), then by 
expanding the force F(x) in a Taylor series about x — (x), we obtain 


F(x) = F« jc» + (x - (x)) 


dF 
dx J T _ 


+ 


W 


(x - (x)) 2 j (FF 
dx 2 


+ 


(6.35) 


x=(x) 


and therefore 


= (F(x)) = F«x» + +■■■ (6.36) 

dt 2 \dx l J x={x) 

The first term on the right-hand side of (6.36) shows the expectation values obeying 
Newton’s second law, while the other terms constitute corrections. When are these 
corrections negligible? For example, we may certainly neglect the second term in 
comparison with the first if the uncertainty Ax is microscopic in scale and the force 
varies appreciably only over macroscopic distances. In fact, this is true whether the 
particle itself is macroscopic or microscopic, and it accounts for our being able to 
use classical physics to analyze the motion of the particles in the Stem-Gerlach 
experiments of Chapter 1. 

An immediate consequence of the commutation relation (6.31) that follows from 
the uncertainty relation (3.74) is 


ft 

AxA Px > - (6.37) 

the famous Heisenberg uncertainty principle. We will return to discuss this important 
relation in Section 6.7, but first we need to venture briefly into momentum space. 
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6.4 The Momentum Operator in the Position Basis 


We begin by using the action of the infinitesimal translation operator on an arbitrary 
state \xj/) to determine the representation of the momentum operator in position 
space. As before, we first express \x(f) as a superposition of position kets It), since 
we know the action of the translation operator on each of these kets: 

f(8x)\\fr}=f(8x) J dx\x)(x\f) 

= j dx' \x')(x' — 8x\\ls) (6.38) 

where in the last step we have made the change of integration variable x + Sx = x’. 
Expanding (x f — Sx\x[r) = t fr(x' — 5x) in a Taylor series about x' to first order, we 
obtain 

\jr(x' — (5x) = \j/(x') — <5jc-— \ff(x') = (x'\ip) — Sx — {x'\i/) (6.39) 

dx' dx' 

Substituting this result into (6.38), we have 

f(Sx)\f} = J dx’ \x') (^{x'\ir) - Sx-~- f {x'\f)^ 

= \t) - Sx j dx’ |x')^;{x , | f) 

= (l - j' Pjx ) (6.40) 


where the last step follows from the explicit form (6.24) of the infinitesimal transla¬ 
tion operator. Therefore 


Px\f) - T f dx> \x')-~-(x’\f) 
i J dx' 


(6.41) 


If we take the inner product with the bra (x\, we obtain a very useful result: 


(x\p x IV') = t f dx '(x\x')^-{x'\ir) 
l J 0X f 

-7 1 . 3 


dx' S(x — x')—— 
dx' 


h d , UK 
= T — ( x \f) 
7 OX 


(6.42) 
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If we choose the state |i fi) = |x'), we obtain the matrix elements of the momentum 
operator in the position basis: 6 

{x\p x \x') = -^-(x\x f ) = - x') (6.43) 

i d.v i dx 

We can also obtain a standard result from wave mechanics by taking the inner product 
of (6.41) with the bra (xfr |: 


(Px) = (f\p x \f) = J dx ’ 


dx' 


j dx' f*{x')-~ f(x')= f dx f*{x)- — f{x) (6.44) 
J 1 dx' J 


h d 
i dx 


The results (6.42), (6.43), and (6.44) all suggest that in position space the momentum 
operator takes the form 


Px -► 

x basis 


h d 
i dx 


(6.45) 


6.5 Momentum Space 


Having introduced the momentum operator as the generator of translations, we now 
consider a new set of states, momentum eigenstates, satisfying 

p x \p) = p\p) (6.46) 

Momentum, like position, is a continuous variable. We can express an arbitrary state 

\ir) as a superposition of momentum states: 

\f) ~ J d P \P)(P\^) (6.47) 

where the integral again runs from — oo to +oo, but here we are integrating over 
momentum instead of position. Since 

(p'\p) =8(p’ - p) (6.48) 

[see the discussion surrounding (6,5)], then 

\ = {f\f) = j dp\(p\f)\ 2 (6.49) 


h It is a somewhat unusual matrix since the row and column vectors have a continuous label. 
To make sense of the derivative of a Dirac delta function, see (C.12). 
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and we can identify 


dp \{p\^)\ 2 


(6.50) 


as the probability that a particle in the state |i \jr) has its momentum between p and 
p + dp if a measurement of momentum is carried out. Just as we refer to {x\\J/) as 
the wave function in position space, we call {p\ijf) the wave function in momentum 
space. 

We can now determine {x\p). the momentum eigenstate’s position-space wave 
function. Take the ket |i/r) in (6.42) to be a momentum eigenket \p). We obtain 

H d 

( x\p x \p) = p(x\p) = r — &\ P) (6-51) 

/ ox 

where we have taken advantage of the momentum eigenket relation (6.46) in the mid¬ 
dle step. The differential equation (6.51) is easily solved to yield (x\p) = Ne ,px,h . 
We can determine the constant N up to an overall phase by requiring the momentum 
state to be properly normalized. First we express the state | p) in terms of the position 
basis as 


Then 



dx |.v) ( v | p) 


(6.52) 


(P'\P) = j dx (p'\x)(x\p) 

/ CO 

dx 

-oo 

= N*N(2nh)8(p - p') (6.53) 

where we have used the representation (C.17) of the Dirac delta function. Thus we 
choose N — 1/ y/2nh so that the normalized momentum eigenfunction is given by 

(x\p) = —L=e ipx/n (6.54) 

s/2 nh 


The Euler identity 


gipx/h _ C0S (p X /),) _)_ j sin (px / ft) (6.55) 

emphasizes the complex character of this oscillatory function. Note that when x 
changes by a wavelength X, the phase changes by 2n. Consequently, pX/h = 2n or, 
more simply, 

A — — (6.56) 

P 
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which is the famous de Broglie relation. Thus in position space the momentum 
eigenfunction (6.54) is a (complex) wave extending over all space with a particular 
wavelength (6.56), while in momentum space, given the normalization condition 
(6.48), it is an infinitely sharp and infinitesimally thin spike. 

Although we have called (6.54) the momentum-state wave function in position 
space (or the momentum eigenfunction), it is important to realize that the amplitudes 
(x\p) provide us with the necessary ingredients to transform back and forth between 
the position and the momentum bases, just as the amplitudes such as (+z|+x) 
permitted us to go back and forth between the S z and the S x bases in Chapter 2. Here, 
instead of a 2 x 2 matrix and matrix multiplication, we have integrals to evaluate: 


(p\f) = j 

f dx(p\x)(x\x(r) = j 

f dx J_ e- ipx ' h {x\f) 
sjlrch 

(6.57a) 

Mf) = j 

f dp (x\p){p\ir) = 

J 

f dp — L= e^'Hpm 

' V2 nh 

(6.57b) 

where both in position and momentum space the integrals run from - 

-oo to +oo. 


These equations show that {p\\ff) and {x\\j/) form a Fourier transform pair. 


6.6 A Gaussian Wave Packet 


The form of the position-space momentum eigenfunction (6.54) gives us another 
way to see why a single momentum state is not a physically allowed state. Such a 
state clearly has a definite momentum p and therefore A p = 0, or zero momentum 
uncertainty. The probability of finding the particle between x and x + dx, 

\{x\p)\ 2 dx = (6.58) 

2nn 

is independent of x. Thus the particle has a completely indefinite position: Ax = oo. 
There is no physical measurement that we can carry out that can put a particle in 
such a state. How then do we generate physically acceptable states, even for a free 
particle? Since the particle is not permitted to have a definite momentum, (6.57b) 
suggests that we should superpose momentum states to obtain a physically allowed 
state | iff) satisfying (i//|i//) = 1. Such a superposition is called a wave packet because 
in position space the momentum eigenfunctions (6.54) are oscillating functions 
characteristic of waves. We are familiar with such superpositions for classical waves 
like sound waves; a clap of thunder is a localized disturbance that audibly contains 
many different frequencies or wavelengths. 

We start with the Gaussian wave packet 

(x\x(r) = = Ne~ xl/2a2 (6.59) 
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which is a mathematically convenient lump in position space to play with. The nor¬ 
malization constant N [a different N than in (6.53)] is determined by the requirement 
that 


J dx f*(x)f{x) = N*N J 


dx e~ xllal = 1 


Carrying out the Gaussian integral, we obtain 7 

1 


N = 


n a 


and therefore 


(6.60) 


(6.61) 


(x\f) = f(x) 


-x 1 /2a~ 


ix a 


The probability density 


f*(x)f(x) : 




Jn a 


(6.62) 


(6.63) 


is plotted in Fig. 6.3a. By changing the parameter a, we can adjust the width of the 
Gaussian and therefore how much the particle is localized. Compare Fig. 6.3b with 
6.3a. The limiting case of (6.63) as a -> 0 even provides us with a representation of 
the Dirac delta function: 


<5(jc) = lim .. e (6.64) 

a->0 fJT a 

In this limit the “function” is nonzero only at x = 0 and from (6.60) has unit area. 

The Gaussian provides us with a probability distribution that is mathematically 
“nice” in that the integrals in both position and momentum space are easy to evaluate. 
We can relate the constanta to the uncertainty Ax in the position of the particle. Since 

M 1 2,2 

(x) = I dx .—;. e x x = 0 (6.65) 

J —oo \J7t Cl 

because the integrand is an odd function of x, and 

/ OO 1 , , „2 

dx e~ x ' la 'x 2 = — (6.66) 

-co sfra 2 

we see that the uncertainty is 


A.r 



a 

Ti 


(6.67) 


7 See Appendix D for techniques to evaluate all the Gaussian integrals in this section. 
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(b) 


Figure 6.3 The probability densities (6.63) and (6.69) of a Gaussian wave 
packet in position space and momentum space, respectively. The value of a 
in (6.63) is 50 percent larger for the position-space probability density in (b) 
than for that in (a). 


We are now ready to determine the momentum-space wave function (p\\[f). 
Substituting the position-space wave function (6.62) into (6.57a), we obtain 

(p\f) = f dx —L= e - ipxlh --L= = I-?— e - p2al l 2Hl ( 6 . 68 ) 

S —oo s/2tcYi ^Oy/lC V V* 

Note that the Fourier transform of a Gaussian is another Gaussian. This is a useful 
result that we will take full advantage of in Chapter 8 . The probability of the particle 
having momentum between p and p + dp is given by 

\(p\f)\ 2 dp= e- p2 “ 2 ' h2 dp (6.69) 

n-sjix 

The momentum probability density | (p\f)\ 2 is shown in Fig. 6.3. We can now easily 
calculate 

(Px) = (f\Px\f) = f dp \{p\t)\ 2 P - ~ 7 = f dpe~ p " al/nl p- 0 (6.70) 
J fX\J7l J —oo 

and 

/ /*00 *r 2 

dp \(p\f)\ 2 p 2 = 7 ~t= / dpe- p2 ^ h2 p 2 =^ (6.71) 

h^jit oo 2a- 
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Thus 


A P X = J(P 2 X ) - ( Px ) 2 = (6.72) 

V2 a 

Notice from (6.67) and (6.72) that Ax A p x — hf 2. Thus the Gaussian wave function 
is the minimum uncertainty state. 


EXAMPLE 6.2 Use the results of Section 3.5 to prove that the wave func¬ 
tion for which Ax A p x = /i/2 must be a Gaussian. 

SOLUTION In Section 3.5 we established the general uncertainty relation 
AAAB > |(C)|/2 for observables A and B for which the corresponding 
operators satisfied the commutation relation [A, B] — iC. The derivation 
started with the Schwarz inequality 

{a\a){m>\(c<W)\ 2 

with |a) = (A — (A»|iA> and |/3) = (B - {B))\f. For AAAB = |(C)|/2 to 
hold, there are two requirements. One is that \p) = c\a), where c is a constant, 
so that the Schwarz inequality becomes an equality and {tjr\F\xlf} — 0, where 
F = 6 + 0 + with 0 = (A — (A))(B - (B ». 

We can simplify things a bit by positioning the origin so that (x) = 0 and 
choosing a frame of reference for which (p x ) =0. Then the two requirements 
become 

cx W = p x \f) 

and 

W (xp., + p x x)\f)=0 


Projecting the first equation into position space, we see that 

h 3(x|i (/) 


cx (x|i//) 


3x 


which has the solution 


(x\xfr) = e icx2/2ti 

The first requirement also shows us that 

{f\p x =C*{f\x 

Using these results in the second requirement, we see that 
(ip-\(xcx + c*xx)|^) = 0 
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or 

(c + c*)(^\x 2 \f) = 0 

Therefore c = — c*, indicating that c is purely imaginary, or c = i\c\. Thus 

{x\f)=e- Mxl / lh 

namely, a Gaussian. 


EXAMPLE 6.3 Calculate (p x ) using the position-space Gaussian wave 
function and compare your result with (6.70). 


SOLUTION 


/ 


h d 


(Px) = (f\Px\f) = / dx f*(x)- — i(r(x) 

i ox 


in 


J—og \JTCQ, 


~ x2 l a2 x = 0 


Similarly, you can calculate (p~) directly in position space and compare your 
result with (6.71). 


TIME EVOLUTION OF A FREE PARTICLE 

One of the advantages of knowing (p\ifr), in addition to being able to determine 
the probability that the momentum of the particle is in some range of momenta, is 
that we can use this amplitude to determine the time evolution of the state of a free 
particle. For a free particle 

> p 2 

H = ~ (6.73) 

2m 

Thus the momentum states | p) are also energy eigenstates. Therefore if we express 
the state of the particle as a superposition of momentum eigenstates in (6.47), we 
can work out how the state evolves in time: 

IlKO) = e~ iH,/n J dp \p)(p\if(0)} 

= f dpe-rt'/*** \ P) ( P wm 

= j dp \p)(p\f{0)) (6.74) 
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For the Gaussian wave packet (6.62), (p |#(0)> is given by (6.68) and thus f(x, t ) 
is given by 

fix, t) = (x| fit)) = I dp e ~'v lt l lmh (x\p)(p\f(0)) (6.75) 

If you are proficient at carrying out Gaussian integrals (see Appendix D), it is 
straightforward to show that 8 


f{x , t) 


y / \fn\a + ( iht/ma )] 


-x~ / {2a~[\+(iht/ma 2 )}} 


(6.76) 


Comparing f*(x, t)f(x, t) with its form (6.63) at t = 0, we see that the position 
uncertainty is given by 


Ax = 


-n/2 



n 2 t 2 \ 

m 2 a 4 ) 


1/2 


(6.77) 


EXAMPLE 6.4 Call T the time such that 

I *lI1 = 1 

I m 2 a 4 

This is the time necessary for significant wave packet spreading. Calculate 
T for (a) a macroscopic 1 g mass with a = 0.1 cm and (b) for an electron 
with a = 10~ 8 cm. 

SOLUTION (a) For a = 0.1 cm and m = Ig, then T — 10 25 s, which equals 
3 x 10 17 years and shows why we do not see macroscopic particles “spread.” 
(b) For an electron, however, with a = 1(C X cm (the size of an atom), we find 
T = 10“ 16 s, so that spreading is a very natural fact of life for a microscopic 
free particle. 9 * * Also notice for a particular particle that smaller a (and hence 
smaller initial uncertainty Ax in^the position of the particle) means more 
rapid wave packet spreading. 

Perhaps this is a good point to address a common misperception. Al¬ 
though a Gaussian wave function is the minimum uncertainty wave function, 
wave packet spreading does not mean that, since Ax increases with increas¬ 
ing time, A p decreases. In fact, as (6.74) shows, 

[p\f(t))=e-^ ,tlmh {p\fm 


8 Proficiency doing Gaussian integrals is a very useful skill, especially when we get to 
Chapter 8. You are thus encouraged to work out Problem 6.4. 

9 In case you are suddenly worried about the long-term survivability of atoms, remember that 

if an electron is bound within an atom in an energy eigenstate, it is in a stationary state, which 

does not spread out or change as time progresses. 
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; Since 

\{p\fm 2 = \(pW(0))\ 2 

the probability that the particle has momentum between p and p + dp does 
not vary with time, as might, at least in retrospect, be expected for a free 
particle, one without any forces acting on it. 


6.7 The Double-Slit Experiment 


Our analysis of the Gaussian wave packet has illustrated a number of features of the 
position-momentum uncertainty relation Ax A p x > H/2. As we noted earlier, this 
relation follows directly from the position-momentum commutation relation (6.31). 
By adjusting the width a of the Gaussian wave packet, we directly control the 
uncertainty in the position of the particle, as (6.67) shows. However, as we make 
the position-space wave packet broader by increasing a, the momentum-space wave 
function {p\ifr) becomes narrower [see (6.72)], maintaining the uncertainty relation 
(see Fig. 6.3). Of course, in the macroscopic world we never seem to notice that 
we cannot specify both the position and the momentum (or the velocity) of the 
particle with arbitrary precision. It is the smallness of H that protects our classical 
illusions. If a particle of mass 1 g is moving with a velocity of 1 cm/s and we 
specify its momentum to one part in a million, that is, A p x — 1CT 6 g- cm/s, then 
Ax ~ 1CT 21 cm, which is 1CT 8 times smaller than the radius of a proton. We would 
be hard pressed experimentally not to say the particle has a definite position. On 
the other hand, for an electron in an atom, with a typical velocity of 10 8 cm/s, the 
momentum p x ~ 10~ l9 g- cm/s, and even if we allow A p x to be as large as p x , 
we find Ax ~ 1CT 8 cm, which is roughly the size of the atom itself. Thus in the 
microscopic world the uncertainty clearly matters. 

We can see the importance of the Heisenberg uncertainty principle at a funda¬ 
mental level by examining the role it plays in the famous double-slit experiment. 
In this experiment, a beam of particles with a well-defined momentum is projected 
at an opaque screen with two narrow slits separated by a distance d, as shown in 
Fig. 6.4. Even if the intensity of the incident beam is so low that particles arrive at a 
distant detecting screen one at a time, when a sufficiently large number of particles 
have been counted, the intensity pattern on the screen is an interference pattern, with 
the location of the maxima satisfying d sin 9 = nk, where the wavelength A of the 
particles is given by (6.56). The classical physicist is mystified by this result, think¬ 
ing that surely a single particle passes through one slit or the other, and thus cannot 
understand how a particle can “interfere” with itself. The quantum physicist realizes 
that a single particle has an amplitude to reach any point on the detecting screen by 
taking two paths, one through the upper slit and one through the lower slit, and that 


Page 226 (metric system) 



6.7 The Double-Slit Experiment | 211 


x 



Detecting 

Screen 


Figure 6.4 The double-slit experiment. 


these amplitudes can interfere with each other to produce the double-slit intensity 
pattern. 10 

If the classical physicist challenges this view by actually observing through 
which slit the particle passes by using a microscope, like the one in Fig. 6.1, and 
shining light on the two slits, the uncertainty relation (6.37) guarantees that the 
interference pattern disappears. If we call the direction along the screens the jc 
direction, determining through which slit the particle passes requires an uncertainty 
Ax < d/2 in the electron’s position. This forces an uncertainty A p x > 2 h/d in 
the particle’s momentum and hence an uncertainty in the angular deflection of the 
particle A 9 = Ap x jp > (2 h/d)/(h/X) — 2X/d, which is of the same order as the 
angular spacing between interference maxima, wiping out the interference pattern. 11 
From this analysis, we can see the pivotal role the uncertainty principle plays in 
maintaining the logical consistency of quantum mechanics: in this experiment it 
keeps us from knowing which slit the particle goes through and at the same time 
observing an interference pattern. 12 


EXAMPLE 6.5 Helium atoms with a speed of 2.2 km/s are projected at 
a double-slit arrangement. See Fig. 6.5. Determine the spacing between 
interference maxima in the detection plane, which is located L — 1.95 m 
behind the slits. The separation between the slits is d — 8 /tm. 


10 We will justify this assertion more fully in Chapter 8. 

11 We are making only an order of magnitude estimate here, taking the right-hand side of the 
uncertainty relation to be of order /?., Planck’s constant. This has the advantage of freeing us from 
worrying about a detailed analysis of the position uncertainty associated with resolving which slit 
the particle went through and it keeps the algebra transparent. 

12 However, an experiment with highly excited rubidium atoms as the projectiles and micro- 
maser cavities in front of the slits as detectors shows that it is possible to determine through which 
slit the atom passes without changing the momentum of the atom, thereby evading the limitations 
imposed by the Heisenberg uncertainty principle. Nonetheless, such measurements also destroy 
the interference pattern. See M. O. Scully, B.-G. Englert. and H. Walther, Nature 351, 111 {1991). 
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Figure 6.5 Schematic representation of the double-slit experiment with helium atoms, 
including a gas reservoir N, electron impact excitation EE, collimating entrance slit A, 
double slit B, the detection plane C, and a secondary electron multiplier (SEM). As the 
helium atoms travel toward an entrance slit, which serves to collimate the beam, they 
are bombarded by electrons that have been fired along the beam direction. As a result of 
these collisions, some of the helium atoms are in excited states that are metastable, that is, 
states with unusually long lifetimes. An excited helium atom that strikes the SEM is very 
likely to be ionized; the SEM then generates an electronic pulse that can be amplified and 
counted, essentially allowing the measurement of single excited atoms. 


SOLUTION The wavelength of the helium atoms is given by 


X = ■ 


mv 


yj.yjj iu j ■ o 

(6.63 x 10~ 27 kg) (2.2 x lO 3 m/s) 


Maxima occur when the difference in path lengths between the two paths 
that helium atoms can take between the source and the detector is an integral 
number of wavelengths: 

d sin 6 = nX n — 0, ±1, ±2, . .. 

where 6 is shown in Fig. 6.4. Notice that Xjd = 5.6 x 10“ 6 , so the angles 
of deflection are very small and it is appropriate to make the approximation 
sin 6 = tan 6 = x/L, where x is the position of the maximum in the detection 
plane. Thus the distance between adjacent maxima is given by 


_ LX _ (1.95 m)(45 x l(T l2 m) 
“ d 8 x 10“ 6 m 


= 11x10 6 m=ll fim 


which is in good agreement with the observed separation (see Fig. 6.6). 13 


13 A detailed discussion of this experiment showing how the interference pattern builds up 
one atom at a time is given by J. S. Townsend, Quantum Physics: A Fundamental Approach to 
Modem Physics, University Science Books, Sausalito, CA (2010). The data from this helium atom 
interference experiment appear on the book’s cover. 
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Figure 6.6 The number of helium atoms detected vs. the position _v in the 
detection plane for atoms with speeds between 2.1 and 2.2 km/s (and therefore 
X ~ 45 pin). The horizontal dashed line shows the dark counts. This figure 
is from Ch. Kurtsiefer, T. Pfau, and J. Mlynek. private communication. See 
their article in Nature 386, 150 (1997). 


6.8 General Properties of Solutions to the Schrodinger Equation 
in Position Space 


So far, we have restricted our discussion of time evolution within one-dimensional 
wave mechanics to that of a free particle, for which the energy eigenstates are also 
momentum eigenstates. When the one-dimensional Hamiltonian, as given in (6.32), 
includes potential energy as well as kinetic energy, we start our analysis by projecting 
the equation of motion (4.8) into position space: 

.4k*. 

(x\H\x//(t)} =iti{x\^-\ir(t)) (6.78) 

dt 


Taking advantage of (6.42) and 


(jc|y(i) = (x|V(x) 


(6.79) 


we can write 


(x\H\if(t)) = (*| 


^ + V(x) 

2m 


Wit)) 


2m dx 2 


+ V(x) 


(xW(t)) 


(6.80) 
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Thus (6.78) can be expressed as 

+ V(x) ^{x,t) = ih^-f{x,t) (6.81) 

2m dx 2 J dt 

where, as in Section 6.1, we have identified 

(x\\ff (t)) = if (x, t) (6.82) 

as the wave function. Equation (6.81) is the time-dependent Schrodinger equation 
in position space. Note that we have replaced the total time derivative of the ket 
|i//(t)> in (6.78) with a partial time derivative of the wave function /f(x. t ) because 
we are only calculating how the wave function evolves in time on the right-hand side 
of (6.81). 

If we take the state | t/r(r)} in (6.78) to be an energy eigenstate, for which the time 
dependence is given by | E)e 1 , we can write the wave function as 

f E (x,t) = (x\E)e~ iEt/h (6.83) 


Substituting this form for an energy eigenfunction into (6.81), we obtain 

h 2 d 2 


2m dx 2 


+ V(x) 


(x\E) = E(x\E) 


(6.84) 


which is often referred to as the time-independent Schrodinger equation in posi¬ 
tion space. This equation also results from projecting the energy eigenvalue equation 


H\E) = E\E) 


(6.85a) 


into position space: 


{x\H\E) = E(x\E) 


(6.85b) 


It is common to write (x\E) — \j/ E (x). We will, however, drop the subscript h and 
implicitly assume for the remainder of this chapter that the wave function t/r (x) is an 
energy eigenfunction. Since we have factored out the time dependence, (x\E) is a 
function only of x and we can replace the partial derivatives in (6.84) with ordinary 
derivatives: 


2m dx 2 


+ V(x) 


fix) = EtJ/(x) 


( 6 . 86 ) 


Let’s first take a specific example to illustrate some of the features of the solutions 
to this differential equation. Suppose that the potential energy V (x) is the finite 
square well 


j 0 |x| < a/2 

I V Q \x j > a/2 


(6.87) 
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V 

-Vo 


-all 0 all Figure 6.7 A finite square well. 


as shown in Fig. 6.7. For this particularly simple potential energy, which is piecewise 
constant, we can solve (6.86) analytically in the different regions: namely, inside the 
well (jx| < a/2) and outside the well (both x < —a/2 and x > a/2). We will restrict 
our attention in this section to solutions with energy 0 < E < V 0 . A classical particle 
would be bound strictly inside the well with this energy, since outside the well the 
potential energy would be greater than the energy, which classically would mean 
negative kinetic energy. 14 

The differential equation (6.86) can be expressed as 


drf_ 
dx 2 
d 2 f 
dx 2 


2m E o 

.—-f = -k z f [x| < a/2 
2m (E - V„) 


h 2 


= q/'j/z |jc | > a/2 


Note that k 2 and q 2 are positive constants: 



( 6 . 88 ) 

(6.89) 


(6.90) 


According to (6.88) and (6.89), two derivatives of the wave function yield just a 
constant times the wave function; thus it is especially straightforward to solve these 
differential equations. In particular, within the well, where differentiating twice gives 
a negative constant times the wave function, the solutions can be written as 

fix) — A sin kx + B cos kx |jc|<a/2 (6.92) 


while outside the well, where differentiating twice gives a positive constant times 
the wave function, the solutions are 


f(x) = Ce qx + De- qx \x\ > a/2 (6.93) 


14 We will discuss the unbound solutions to equations such as (6.86) in Section 6.10. 
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Figure 6.8 A schematic diagram of energy eigenfunctions of 
the finite square well for three different energies: an energy 
E < Ei, where E\ is the ground-state energy, E = E h and 
E > Only for E = E x is the wave function normalizable. 


Actually, since the solutions should satisfy the normalization condition (6.12), we 
must examine separately the regions x < —a/2 and x > a/2 and discard the expo¬ 
nential that blows up in each of these regions. Therefore, 

f(x) = Ce qx x < —a/2 (6.94) 

f(x) = De~ qx x > a/2 (6.95) 


Thus we see that the solution oscillates inside the well, where E > V, and is 
exponentially damped outside the well, where E < V. 

Since we are seeking a solution to a second-order differential equation, the 
different functions in the three regions must join up smoothly, that is, they must 
be continuous (so that the first derivative is well defined) and have a continuous first 
derivative everywhere. This condition on the continuity of the derivative follows 
directly from “integrating” the Schrodinger equation: 


J Jt) „(A) = f x+ ' dx ± d ± 

dx / x+s \ dx ) x _q Jx—s dx dx 


f x+£ 2m 

L dx y 


E)xfr (6.96) 


since the right-hand side vanishes for well-behaved wave functions in the limit e -* 0 
unless the potential energy V is infinite. We will see an example where the derivative 
is indeed discontinuous in the next section, when we consider the infinite potential 
well. 15 

If we start sketching from the left a bound-state wave function for the finite poten¬ 
tial well, we see an exponential that rises as x increases (Fig. 6.8). At the boundary 
of the potential well, this exponential (6.94) must match up with the oscillatory 
solution (6.92), with the wave function being continuous and having a continuous 
derivative across the boundary. This oscillating function must then join smoothly 


15 Also, see Problem 6.19. 
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Figure 6.9 (a) The energies and (b) the corresponding energy eigenfunctions for a finite 
square well with four bound states. 


onto a damped exponential (6.95) at the x = a/2 boundary. This turns out to be a 
nontrivial accomplishment: only for special values of the energy will this matching 
be possible. Otherwise, the oscillatory function will join onto a combination of rising 
and damped exponentials, with the rising exponential blowing up as x oo. This 
effect is readily seen if you integrate the Schrodinger equation (6.86) numerically. 16 
Figure 6.9 shows the energies and corresponding eigenfunctions for a finite square 
well that admits four bound states. If you are interested in examining how to deter¬ 
mine analytically the allowed values of the energy for the particular potential energy 
well (6.87), turn to Section 10.3, where this calculation is carried out for the three- 
dimensional spherically symmetric square well; the mathematics is essentially the 


16 A numerical solution of the Schrodinger equation for a square well potential is discussed by 
R. Eisberg and R. Resnick, Quantum Physics of Atoms, Molecules, Solids, Nuclei, and Particles, 
2nd ed., Wiley, New York, 1985, Appendix G. 
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same as for solving the one-dimensional well. 17 In the next section we will examine 
how this quantization of the energy arises in a particularly simple example in which 
we let V 0 -> oo. 

Finally, we should note here that this combination of oscillatory and exponential¬ 
like behavior of the energy eigenfunction depending on whether the energy is greater 
than or less than the potential energy, respectively, is generally true, even when the 
potential energy is not a constant. For example, if V = V(x), then when E > V(x) 
we can write 


= -^r[£ - ' v (x)]f = ~k 2 {x)if (6.97) 

Since k is not a constant here, we cannot immediately write down the solution as 
in (6.92). Flowever, note that if f > 0, then d 2 xj//dx 2 < 0. Thus if the wave function 
is positive, the second derivative is negative; that is, the function is concave down. 
It must therefore bend back toward the axis. Similarly, if iff < 0, then d 2 xj//dx 2 > 0. 
Thus if the wave function is negative, the second derivative is positive and the 
function is concave up. In either case the function bends back toward the axis in 
an oscillatory manner. Also note for a particular value of V(x) that the magnitude 
of the energy determines how rapidly the wave function oscillates. The larger the 
energy, the larger the value of k 2 (x), and the more rapidly the function bends back 
toward the axis. Thus the lower energy eigenfunctions have the smaller curvature 
and, consequently, a smaller number of nodes. You can see this pattern in the energy 
eigenfunctions of the finite square well shown in Fig. 6.9. 

In a region in which V (x) > E, on the other hand, 

d 2 xh 2m 

= -^[V(jr) - E ]t// = q 2 {x)xf, (6.98) 

Here, if the wave function t/r is positive, the second derivative is positive as well, 
and the function is concave up; it bends away from the axis. We call such a solution 
an exponential-like solution. A similar bending away from the axis is seen if xfr is 
negative. Thus there can be no physically meaningful solutions for which V(x) > E 
everywhere, for then the wave function must eventually diverge. However, as long as 
there is some region for which E > V(x), the exponential-like solution “turns over” 
into an oscillatory-type solution, and the wave function need not diverge. Notice that 
as we move from a region in which E < V(x) to a region in which E > V (.r). we 
pass a value for x such that V(x) = E, at which point the second derivative vanishes. 


17 The major difference between the one- and three-dimensional problems is that in three 
dimensions the variable r = -/v 2 + y 2 + z 2 replaces x. Clearly, r cannot be less than zero, and 
in fact there is a boundary condition that eliminates the cosine term of the one-dimensional 
solution (6.92). 
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Thus there is a point of inflection where the curvature changes as we move between 
the two regions. 

These quite general characteristics of the energy eigenfunctions make it possible 
to sketch them in a rough way without actually solving the Schrodinger equation. 
In the general case in which V depends on x in other than a piecewise-constant 
manner, the eigenfunctions are not sines and cosines or exponentials. However, 
the eigenfunctions will still look roughly the same, exhibiting oscillatory behavior 
in regions in which E > V(x) and exponential-like behavior in regions in which 
E < V (x). A nice example to which we will turn in Chapter 7 is the harmonic 
oscillator, for which V (.v) = ' morx 2 . Some of the energy eigenfunctions in position 
space for the harmonic oscillator are shown in Fig. 7.7. 

6.9 The Particle in a Box 


A particularly easy but instructive energy eigenvalue equation to solve directly in 
position space is the one-dimensional infinite potential energy well 


VU) = 


0 |jc | < a/2 
oo |,v| > a/2 


(6.99) 


which is shown in Fig. 6.10a. 18 Outside the well, the energy eigenfunction must van¬ 
ish, as can be seen by examining the limit as V 0 -> oo for the wave functions (6.94) 
and (6.95) for the finite well. As for the finite well, the most general solution to the 
differential equation (6.88) inside the well is given by 


rjr(x) — A sin kx + B cos kx |x| < a/2 (6.100) 


Since the wave function vanishes outside the well, the requirement that the wave 
function be continuous dictates that 

.-w ■ 

x/r ( — ^ = A sin — + B cos — = 0 (6.101a) 

\ 2 / 2 2 


and 


i jr 



, . —ka r , — ka 

= A sin-h B cos- 

2 2 


. • ka _ ka 

= — A sin-I - B cos —- = 0 

2 2 


(6.101b) 


18 Mathematically, it is even easier if we choose our origin of coordinates to be at one edge of 
the box. See Problem 6.13. 
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Figure 6.10 (a) The infinite potential energy well with the lowest four allowed energies 
and (b) the corresponding energy eigenfunctions. This potential well possesses an infinite 
number of bound states. 


These two equations can be expressed in matrix form as 


/ sin(Ta/2) cos(ka/2) \ ( A \ 

= 0 ( 6 . 102 ) 

\ — sm(ka/2) cos(ka/2) / \ B J 

For a nontrivial solution to this set of homogeneous equations in the two un¬ 
knowns A and B, we must demand that the determinant of the coefficients vanishes: 


or simply 


sin(Ta/2) cos(ka/2) 
— sin(ka/2) cos(ka/2) 


. . ka 
2 sin — cos 
2 


ka 

~2 


— sin ka = 0 


This equation is satisfied for 


k n a = nn n = integer 


(6.103) 


(6.104) 


(6.105) 
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where we have put a subscript n on the k that is specified by the particular integer n. 
For n = 1, 3, 5, , then k n a/2 = n/2, in/2, 5n/2, , and cos(k n a/2) = 0. 

Substituting this result into (6.102), we see that A = 0 and therefore 

xfr n (x) = B n cos n = 1, 3, 5, ... (6.106a) 

a 

Forn = 2, 4, 6, ..., then A:„a/2 = 7r, 277, 3:71, . . ., andsin(ic„a/2) = 0. Substituting 
this result into (6.102), we find that B — 0 and therefore 

njr Y 

t/f n (x) = A„ sin- n — 2, 4, 6, ... (6.106b) 

a 

We can determine the constants A n and B n by imposing the normalization con¬ 
dition (6.12), namely. 


/ 


1= / dx \l/*(x)f n (x) = 


f 


‘a/2 

nnx 

dx B*B„ cos 2 

n n 


—a/2 

a 

'a/2 

nnx 

dx A* A n sin 2 


-a/2 

a 


n = 1, 3, 5, ... 


(6.107) 


n = 2, 4, 6, . .. 


Up to an overall phase, this tells us that A n = B n = s/2/a and therefore 


f„(x) 


[2 nnx 

- cos -— — n — 1, 3, 5, ... 
V £7 a 

2 . nnx . . . 

sin- n = 2, 4, 6, ... 

a a 


\x\<a/2 (6.108) 


Note that we have not included the n = 0 solution because for n = 0, t// = 0, corre¬ 
sponding to no particle in the well. Also note that the negative integers in (6.105) 
merely change the wave functions (6.106b) into the negative of themsel ves, corre¬ 
sponding to just an overall phase change for these states and not to different states 
themselves. 

In addition to labeling the energy eigenfunctions (shown in Fig. 6.10), the quan¬ 
tum number n specifies the corresponding energies. Since 


k n a = 


12m E n 


= nn 


n = 1, 2, 3, ... 


(6.109) 


we have 

h 2 n 2 n 2 

T n = 1,2,3,... (6.110) 

2ma l 

For the particle in the box, it is especially easy to see why only discrete energies are 
permitted. The requirement that the wave functions vanish at the boundaries of the 
box means that we can fit in only those waves with nodes at x - ±a/2. 
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Figure 6.11 The ground-state energy eigenfunction for the small box of 
width a (solid line) and the ground-state and second-excited-state energy 
eigenfunctions of the bigger box of width 2 a (dashed lines). 


EXAMPLE 6.6 Suppose that a measurement of the energy is carried out 
on a particle in the box and that the ground-state energy = h 2 Tt 2 /2ma 2 is 
obtained. We then know that the state of the particle is the ground state, with 
energy eigenfunction {x\ E x = h 2 7i 2 /2ma 2 ) = f x (x). What if we now change 
the potential energy well that is confining the particle and pull the walls of 
the well out rapidly so they are positioned at .r = ±a instead of x = ±a/ 2? 
In fact, we imagine pulling the walls out so rapidly that instantaneously the 
state of the particle doesn’t change. As can be readily seen by comparing 
the wave function of the particle in this state with the energy eigenfunctions 
of the new, larger potential well (see Fig. 6.11), the particle is no longer in 
an energy eigenstate. Thus we can ask, for example, what the probability 
is that a subsequent measurement of the energy of the particle will yield 
a particular energy eigenvalue such as the ground-state energy of the new 
well. 19 

SOLUTION If we call the initial state | i) and the final state |/), the am¬ 
plitude to find a particle in the state | i) in the state \ f) is (f\i). Since we 
have already calculated the position-space wave functions, it is convenient, 
as noted in Example 6.1, to calculate the amplitude (f\i) by inserting a com¬ 
plete set of position states between the bra and the ket: 


19 For a more physical example, see Problem 10.7. 
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(f\i) = j dx (f\x){x\i) 

The amplitude {jc|* ), where the initial state is the ground state of the well of 
width a (|i) = If* 111 ' 110 }), is given by 


{x\Ef d ' ha ) = ff dtha 


(x) = 



0 \x\> a/2 


while the amplitude (f\x), where the final state is the ground state of the 
well with width 2a(\f) = ]£] vidth 2a )), is given by 


/ rp width 2 a 


I*) = 



|jc | < a 
|x| > a 


Thus 

(£f dth2fl |£] vldtha > = j dx {£■ W'dth 2a \x)(x\E " idth “) 



where the integrand is nonzero only for |x| < a/2 because (Ej vidth a \x] is 
nonzero only in this region. Thus the probability of finding the particle in 
the ground state of the bigger well is 


| ^ g width 2 a | j? width a j |~ 


64 
9t r 2 


— 0.72 


In this way we could go on to calculate the probability of finding the 
particle in the other energy eigenstates of the bigger well. The form of the 
energy eigenfunction for the n =;-3 state of the bigger well, as shown in 
Fig. 6 .11, suggests that there is a significant overlap of the wave functions 
width and ^width 2 a^ an( j t ^ us t | iere should be a significant probability of 
finding the particle in this n = 3 state as well (see Problem 6.11). On the 
other hand, we can also quickly see that there is zero amplitude of finding 
the particle in the even n states. Since (x|£} vidtha ) is an even function of 
x \i/r n (— x) — i jr n (x) for n odd] while (£W' dth 2a \x) for n even is an odd 
function of x ( 1/4 (—x) = —t//„ (x ) for n even], the product of an even and an 
odd function is of course an odd function, which vanishes when integrated 
from —a/2 to a/2. The evenness or oddness of the energy eigenfunctions, 
often referred to as their parity, turns out to be a general characteristic of the 
eigenfunctions of the Hamiltonian when the potential energy is even, that is, 
V (—x ) = V(x). We will discuss the reason for this more fully in Chapter 7. 
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Finally, we can ask how the system evolves in time after the walls of the 
potential energy well have been pulled out. Since the system is no longer in 
an energy eigenstate, it is no longer a stationary state, and thus can exhibit 
interesting time dependence. Since the initial state at t = 0 can be written as 
a superposition of the energy eigenstates: 


IVKO)) = |E] widthfl > = £ |£ n width2a )(E" 


width 2 a | g width a ^ 


then 


^ width la\ I r-width 2 a i £■ width «\ 


\n*))= e - iHtlh Y,\ E n 

n 

_ t£* idth2a //ft | g width 2a width 2a | £ width a ^ 


and therefore 


{x\^(t)) = Y j e ~ ihnV ' ,l ' ima2 

n 

-E«" 


width 2a \ / rrwidth 2a i cwidth a\ 


^ | gwtuin za ^ gwiuin za | j? 
hn 2 n 2 t/%ma 2 ^ width 2a / / t-width 2a\ fwidthu 


'1 


Once the amplitudes (£* KItn la \E ] 


width 2 a i cwidth jj ave b een calculated, it is probably 
best to carry out the sum numerically to see how the wave function evolves 
in time. 


6.10 Scattering in One Dimension 


Let’s turn our attention to solutions of the Schrodinger equation for energies such 
that the particle is not bound, or confined, in a potential well. For example, for the 
potential energy well (6.87) we consider solutions with E > V 0 , which are oscillatory 
everywhere, just like the momentum eigenfunctions (6.54). As for the momentum 
states, we will see in explicitly solving the Schrodinger equation that the energy 
eigenvalues take on a continuum of values in these cases. As for the free particle 
that we treated in Section 6.6, the way to generate physically acceptable states 
is to superpose these continuum energy solutions to form a wave packet. Such a 
wave packet will exhibit time dependence. We can form a wave packet that is, for 
example, initially localized far from the potential well and then propagates to the 
right, eventually interacting with the well and producing nonzero amplitudes for the 
wave packet to be both transmitted and reflected, as shown in Fig. 6.12. This is a 
typical scattering experiment in which particles are projected at a target, interact with 
the target, and are scattered. Scattering in one dimension is relatively straightforward 
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(b) 


Figure 6.12 A schematic diagram showing (a) a wave packet 
incident on a potential energy barrier and (b) the reflected 
and transmitted waves. 


because the only options for the particle are reflection or transmission. Although the 
right way to analyze scattering is in terms of wave packets, 20 often the wave packet 
is sharply peaked at a particular value of the energy and thus sufficiently broad in 
position space in comparison with the size of the region over which the potential 
energy varies that we can treat it as a plane wave when analyzing the scattering. This 
turns out to be a big simplification, but it raises the question: How do we calculate 
the probabilities of reflection and transmission when we are dealing with an energy 
eigenstate that is a stationary state and thus doesn’t show any time dependence? The 
answer is that we can think of scattering in terms of a steady-state situation in which 
particles are being continually projected at the target; some of this incident flux is 
reflected and some of the flux is transmitted. We can relate this flux of particles to a 
probability current that is needed to ensure local conservation of probability. 


THE PROBABILITY CURRENT 

To see how this probability current arises, consider 3(i jj*ip)/dt, the time rate of 
change of the probability density. Using the time-dependent Schrddinger equa¬ 
tion (6.81), we see that the time derivative of the wave function is given by 


drjf(x, t) 1 
~dt ~ ih 


h 2 d 2 f(x, t) 
2m dx 2 


+ V(x)f(x, t) 


( 6 . 111 ) 


Therefore 


9i ff*(x, t) _ 1 

dt ih 


h 2 d 2 f/*(x, t ) 
2m dx 2 


+ V(x)f*(x, t) 


( 6 . 112 ) 


20 For a discussion of one-dimensional scattering in terms of wave packets, see R. Shankar, 
Principles of Quantum Mechanics, 2 nd edition. Plenum Press, New York, 1994. Section 5.4. 
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and 


~di~ 


iff 


dt dt 

ifr 

h 2 d 2 f* 

ih 

2m dx 2 

iff 

/ h 2 d 2 ^ 

ih 

\ 2m dx 2 


+ V (x)x(r 
+ f 


+ 


f* 

ih 


h 2 d 2 \lr 
2m dx 2 


+ V(x)iff 


h 2 9 2 i (r 


if) \ 2m dx 2 


This equation can be expressed in the form 

I) )j)’ t// 


dt 


dJx 

dx 


where 


h 

2m i 


r d -i- - 

dx v dx 


(6.113) 


(6.114) 


(6.115) 


where j x is called the probability current. This is just the form that we expect for 
a local conservation law. 21 For example, if we integrate (6.114) between x = a and 
,r = b, we obtain 


d f b r * 

J t j a dx * * = ~Jx (*. 0 + j x t) (6.116) 

If the probability of finding the particle between a and b increases, it does so because 
of a net probability current flowing into the region, either at a [positive current flows 
in the positive a direction and hence j x (a, t) > 0 means inward flow at a] or at b 
[negative current means current in the negative x direction and hence j x (b, t) < 0 
means inward flow at b]. See Fig. 6.13, Thus the probability in a region of space 
increases or decreases because there is a net probability flow into or out of that 
region. 


A POTENTIAL STEP 

Let s take the particularly simple example of scattering from the potential energy 
step 


V(x) = 


0 x < 0 
F 0 x > 0 


(6.117) 


In three dimensions, local conservation of charge is contained in the relation 

3p 


dt 


+ v.j = o 


where p is the charge density. A similar relation holds for the probability density in three 
sions. See Section 13.1 


dimen- 
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j x (a, t) > 0 


jjb,t)<0 


Figure 6.13 The probability of the particle being 
in the one-dimensional region between a and b 
increases with probability flowing into the region 
either at a or at b. 


V 


Vo 


0 


x 


Figure 6.14 A step potential. 


shown in Fig. 6.14 to illustrate how we relate the probability current to the probability 
of reflection and transmission. We wish to determine the energy eigenstates, for 
which 


ir E (x, t) = <//(.v)e il:t/h 
where i p(x) satisfies (6.86). To the left of the barrier 

fr d 2 1 // (.v) 


2m dx 2 


Etfr(x) x < 0 


which has solutions 


f(x) = Ae ikx + Be~ ikx x < 0 


(6.118) 


(6.119) 


( 6 . 120 ) 


with 


k = 



( 6 . 121 ) 


Notice that we have chosen to write the oscillatory solutions of (6.119) in terms 
of complex exponentials instead of sines and cosines, as in (6.92). The reason 
becomes apparent when we evaluate the probability current (6.115) for the wave 
function (6.120): 


Jx 


— (\A\ 2 ~\B\ 2 ) x<0 

m 


( 6 . 122 ) 
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We thus can identify 


UK , , ,9 

Jinc = —\A\ 2 (6.123) 

m 

as the probability current incident on the barrier from the left and 


kk 2 

J tei =— \B\ 2 (6.124) 

m 

as the magnitude of the probability current reflected from the barrier, showing that 
the probability of reflection is given by 


Jre f _ \B? 

■Zinc Ml 2 


(6.125) 


The probability of transmission for this scattering experiment is given by 


T = 


itrans 

,/inc 


(6.126) 


where j trails is the probability current to the right of the step. In order to evaluate R 
and 7', we need to solve for the wave function for x > 0 and then satisfy the boundary 
conditions at x = 0. The wave equation to the right of the step is given by 


d 2 f(x) 
dx 2 


= ^<Vb- E)t(x) 


x > 0 


(6.127) 


We consider two cases. 


Case 1 : E > V 0 

Since the energy is greater than the potential energy for x >0, the solutions to (6.127) 
are given by 


fix) = Ce ik ° K + De~ ik ° x x >0 (6.128) 


where 


___ 

W/^ (£ ~ Vo) (6A29) 

The D term generates a probability current flowing to the left for x > 0. Such a 
term would be generated physically if the experiment in question involved projecting 
particles at the potential step from the right. Clearly, the solutions to the differential 
equation should permit this possibility. However, if we restrict our attention to an 
“experiment’' in which particles are incident on the potential step only from the left, 
we are free to set D = 0 in (6.128). In this case 

f{x) = Ce ikt > % x > 0 (6.130) 
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and substituting the wave function (6.130) into (6.115), we find 


flk, 


itrans 


-|C | 2 


m 


Thus for the step potential the transmission coefficient is given by 

k () |C | 2 


T = 


k | Aj 2 


(6.131) 


(6.132) 


Let’s determine the reflection and transmission coefficients in terms of V 0 and E. 
In passing from the region to the left of the step to the region to the right of the step, 
we must require that the energy eigenfunction be continuous and have a continuous 
first derivative: 


A + B = C 
ik(A — B) = ik Q C 


which yield 


C = 


2k 


k -j- kt 


A and B 


k — k t 


o 


o 


k + kn 


(6.133) 


(6.134) 


Note that we have satisfied the boundary conditions for any value of the energy. 
Therefore the allowed energies do take on a continuum of values. Using (6.125) 
and (6.132), we find 


R 


(k - k {) ) 2 
(k + k 0 ) 2 


4 kkn 


C k + k 0 f 


(6.135) 


Note that 


R + T = \ (6.136) 

as it must for probability to be conserved. 

Case 2: E < V 0 

Here the solution for x < 0 is the same as ( 6 .120), but for jc > 0 we have 

^ = f? ( V o - E ) * = Jf > 0 (6.137) 

with q 2 > 0. Now we must choose the solution 

iff = Ce~ qx (6.138) 

since the increasing exponential would cause the wave function to blow up as 
x -> oo. Rather than match the wave function at the x = 0 boundary again, a 
comparison of the wave function (6.138) with (6.130) shows that we can obtain 
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the solution for E < V () from the solution for E > V 0 by the transcription ik f) —q. 
Thus from (6.125) and (6.134), we obtain for E < V 0 


\k - iq| 2 _ k 2 + q 2 
\k + iq\ 2 k 2 + q 2 


(6.139) 


Conservation of probability requires that the transmission coefficient must vanish 
for E < V 0 even though 


C = 


2k 

k + iq 


A^O 


(6.140) 


and the wave function penetrates into the potential energy barrier. Note that we can¬ 
not just make the transcription ik 0 —q for the transmission coefficient in (6.135) 
because, unlike the reflection coefficient, which is given by (6.125) whether the 
energy is greater than or less than the height of the barrier, the transmission coef¬ 
ficient is determined by the probability current for x > 0, and when the argument 
of the exponential in the wave function is real, / trans = 0. This emphasizes that the 
transmission coefficient is given by (6.126), and not by (6.132) in general. 


TUNNELING 


Suppose that we consider particles with energy E < V 0 incident on a potential energy 
barrier of height V 0 , but this time we chop off the end of the barrier so that 


V = 


0 X < 0 

V 0 0 < x < a 

0 x > a 


(6.141) 


as shown in Fig. 6.15. Now the energy eigenfunction is given by 


i>(x) = 


Ae' kx + Be~ lkx x < 0 
Fe qx + Ge~« x 0 <x <a 
Ce ikx x > a 


(6.142) 


with k and q given by (6.90) and (6.91), respectively. Note that both the rising and the 
falling exponential appear as part of the solution for 0 < x < a because the barrier 
is of finite width a and therefore the rising exponential cannot diverge. 

The procedure for determining the transmission coefficient is straightforward, 
if somewhat laborious. Satisfying the boundary conditions on the continuity of the 
wave function and its first derivative leads to four equations: 


A+B=F+G 
ik(A - B) = q(F - G) 

Fe qa + Ge~ qa = Ce ika 
q(Fe qa - Ge~ qa ) = ikCe ika 


(6.143) 
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V 


Vo 


Figure 6.15 A square potential barrier. 


The unknowns B, F , and G can be eliminated from these equations, yielding 

T _ Jx>a _ ( hk/m)\C\ 2 = _ 

jinc ( hk/m)\A\ 2 

1 + 

for the probability that the particle will tunnel through the potential barrier. For 
typical microscopic parameters such as an electron with 5 eV of kinetic energy 
tunneling through a barrier 10 eV high and 0.53 A wide (the Bohr radius), the 
transmission probability is 0.68. Thus tunneling is a common occurrence on the 
microscopic level. 

A useful limiting case of the transmission coefficient occurs for ga » 1. In this 
case 


k 2 + q 2 
2k q 


(6.144) 


sinh 2 qa 


pQa - p-qa e q a 

sinh qa = ----» — 

2 2 


and 


\*k~ + q“- 


2 

e~ 2qa 


(6.145) 


(6.146) 


We can quickly see why tunneling is not a common macroscopic occurrence if we 
plug in some typical macroscopic parameters such as V 0 — E = 1 erg, a — 1 cm, and 
m — 1 g. Then qa ~ 10 27 , so that T ~ e~ l0 ~ , an incredibly small number. 


EXAMPLE 6.7 Determine the transmission coefficient for a particle of 
mass m projected with energy E = V 0 from the left at the potential energy 
barrier 


V(x) = 


0 x < 0 

V' 0 0 < x < a 

0 x > a 
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V 



Figure 6.16 A potential energy barrier of height V 0 
for scattering of a particle with E = V 0 . 


Sec Fig. 6.16. Check that your result behaves appropriately in the limit 
a —» 0. 

SOLUTION First, we write the most general solution to the time- 
independent Schrodinger equation as 


fix) = 


Ae ikx + B<T m x <0 
C + Dx 0 < x < a 

Fe ,kx x > a 


Note: The most general solution to a second-order differential equation has 
two arbitrary constants. This is true in each of the regions x < 0,0 < x < a, 
and x > a. 

Continuity of the wave function and its derivative at x = 0 and x = a 
leads to 


A + B = C 

ik(A — B) = D or A — B = — 

ik 

C + Dci = Fe ika 
D = ikFe ika 


Adding the first two equations yields 


2 A = C + 


D 

ik 


The third equation tells us that 

C = Fe ika - Da = Fe ika (1 - ika) 
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j where we have used the last of the four equations that result from applying 
the boundary conditions in the final step. Thus we now have C and D in 
terms of F and hence 

I 2A = Fe ika (2 - ika) 

or 

| F e~ ika 

I A 1 — ika/2 

Therefore 

| F |2 gika g—ika j 

|A| 2 1 + ika/2 1 — ika/2 l + k 2 a 2 /4 

As a check, note T -» 1 as a -*■ 0, in which case the barrier disappears. Also 
note that T -> 0 as a —> oo. 

This example illustrates the general strategy for reducing the four equa¬ 
tions with the five unknowns—in this case, A, B, C , D, and F —that result 
from satisfying the boundary conditions that the wave function is continuous 
with a continuous derivative to one equation that can be used to determine 
the transmission coefficient. 


A NON-SQUARE BARRIER 

Even on the microscopic level, there are many situations where qa is sufficiently 
large that we can take advantage of the approxi mation (6.146) for the transmission 
coefficient. Notice that if we evaluate the natural log of the transmission coefficient 
(6.146), we find 

In T —v In ( , ) - 2qa -* - 2 qa (6.147) 

where we have dropped the logarithm relative to qa since ln(almost anything) is 
not very large. In the limit that (6.147) is a good approximation, we can use it to 
calculate the probability of transmission through a non-square barrier, such as that 
depicted in Fig. 6.17. When we include only the exponential term in (6.146), the 
probability of transmission through a barrier of width 2 a is just the product of the 
individual transmission coefficients for two barriers of width a. Thus, if the barrier 
is sufficiently smooth so that we can approximate it by a series of square barriers 
(each of width Ax) that are not too thin for (6.147) to hold, then for the barrier as a 
whole 


In T ~ In Y[ T t = In 7' ( ~ -2]T q, Ax 
i i i 


(6.148) 
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V 



Figure 6.17 A non-square barrier can be approximated by a sequence of square barriers 
if the potential energy V (x) does not vary too rapidly with position. 


If we now assume that we can approximate this last term as an integral, we find 


T ~ exp 2^ q t Ax 

where the integration is over the region for which the square root is real. You 
may have a somewhat queasy feeling about the derivation of (6.149). Clearly, the 
approximations we have made break down near the turning points, where E = V(x). 
Nonetheless, a more detailed treatment using the WKB approximation shows that 
(6.149) works reasonably well. 22 As an example, we can use it to estimate the 
currents generated by field emission for a metal (see Problem 6.25). 


~ exp 


H 


dx 


2m. 

Id 


[V(x)-£] (6.149) 


6.11 Summary 


hi this chapter we have turned our attention to variables such as position and 
momentum that take on a continuum of values, instead of the discrete set of values 
characteristic of variables like angular momentum. Thus instead of expressing a ket 
ly'/} as a discrete sum of eigenstates as in (1.33), we write it as 

W) — J da \a){a\ijr) (6.150) 

where the ket \a) is an eigenket of the operator A corresponding to the observable A: 

A\a)—a\a) (6.151) 


22 The WKB approximation and its application to tunneling:is discussed by L. Schiff, Quantum 
Mechanics, 3rd ed., McGraw-Hill, New York, 1968, Chapter 8, Section 34. 
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From (6.150) we see that the identity operator is given by 

j da \a){a\ = 1 (6.152) 

Substituting | a') for \ift) in (6.150), we find that the states of a continuous variable 
satisfy the normalization condition 

{a\a') = S{a -a') (6.153) 

where S(a — a') is a Dirac delta function. On the other hand, a physical state |i j/) 
satisfies 

1 = (i fr\i(r) = j da {ij/\a)(a\xl/) — j da |{a|i/r)| 2 (6.154) 

indicating that we should identify 


da\(aW)\ 2 (6.155) 

as the probability of finding the variable A in the range between a and a + da if a 
measurement of A is carried out. 

We have restricted our attention in this chapter to one-dimensional position states, 
for which 


„v|.y) = x\x) (6.156) 

and one-dimensional momentum states, for which 

p x \p) = p\p) (6.157) 

Just as angular momentum made its appearance in Chapter 3 in the form of an 
operator that generated rotations and energy entered in Chapter 4 in the form of 
an operator that generated time translations, here linear momentum enters in the 
form of an operator that generates translations in space. The translation operator is 
given by 

f (a) = e~ i3 ^ h (6.158) 

where the action of the translation operator on a position ket Lr) is given by 

f (a)|jc) = \x + a) (6.159) 

In order for probability to be conserved under translations, the translation operator 
must be unitary: 

T'(a)T(a) = e'^'/Y ’^ a ' h = 1 (6.160) 
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and therefore the linear momentum operator must be Hermitian: 


P1 = P* ( 6 - 161 ) 

A consequence of the momentum operator being the generator of translations 
is that the position and momentum operators do not commute [compare the equa¬ 
tion T(a)x\x) = x\x + a) with xT{a)\x) — (x + a)|jc + a) | but rather satisfy the 
commutation relation 


[x, p x ]-ih 


(6.162) 


leading to the Heisenberg uncertainty relation 

. . h 

&xA Px > - 


(6.163) 


A further consequence is that the action of the momentum operator p x in position 
space is given by 

Jr Q 

{x\p x \f) ~- — (x\f) (6.164) 

i dx 

and therefore 

f°° ft a 

(Px) = (flPxW) = / dx {\jf\x)-—-{x\f) (6.165) 

J-oo 1 dx 


Equation (6.164) indicates that in position space we can represent the momentum 
operator by a differential operator 


n a 


x basis i dx 


Thus the equation of motion 


(6.166) 


for the Hamiltonian 


becomes in position space 


H\f(t)) =ift—\xfr{t)) 
dt 


H = — + V(x) 
2m 


( 6 . 167 ) 


(6.168) 




ft 2 9 2 


+ V (x ) 


(x\f (t)) = ift^(x\xl/(t)) (6.169) 
dt 


2m dx 2 

the usual time-dependent Schrodinger equation. We make the identification 


(x\x//(t)) = f(x, t ) 


( 6 . 170 ) 
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since the amplitude to find a particle in the state jt//(/)} at the position x is just what 
we usually call the wave function in position space. Thus, according to (6.155), 
dx | ir(x, t)| 2 is the probability of finding the particle between x and x + dx, the 
usual Bom interpretation of wave mechanics. 

The energy eigenvalue equation 

HW) = E\f) (6.171) 


in position space becomes 


or simply 


hh 2 __ 

2m dx 2 


+ V(jc) 


(x\E) = E(x\E) 


(6.172a) 


k 2 a 2 


f E {x) = Ef E {x) 


(6.172b) 


This differential equation can be solved to determine the energy eigenstates for one¬ 
dimensional potentials. See Sections 6.8 through 6.10. 

The connection between the position-space wave function (x|i/r) and the 
momentum-space wave function (p\\//) is through the set of amplitudes 

{x\p) = —L=e ipx/n (6.173) 

V2jr h 

These amplitudes can be used to transform back and forth between position and 
momentum space: 


(p\f) = J 

f dx (p\x}{x\it} = j 

f dx X~e~ ipxlH (xW) 

(6.174a) 

(xW) = j 

f dp (x\p)(p\4) = J 

f dp -jL= e^Hpm 

V2rth 

(6.174b) 


Thus the position-space and momentum-space wave functions form a Fourier trans¬ 
form pair. 


Problems 


6 . 1 . 

(a) Use induction to show that [x tt , p x J = //mi"" 1 . Suggestion: Take advantage 
of the commutation relation [AZJ, C] = A[B, C J + [A, C\B in working out 
the commutators. 
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(b) Using the expansion 


F(x) = F( 0) + 


dF 


1 / d 2 F 

. x + ~ ~—- 
dx / x= o 2! \ dx~ 


+ • 


1 


x=0 

d n F 


n\ \dx"/ x=0 


x n + ■ 


show that 

9 F A 

r Fix), P x ] = th—(x:) 

ax 


(c) For the one-dimensional Hamiltonian 


ft = ^r + TO 

2 m 


show that 


6.2. Show that 


and 


<HPx 

dt 


dV 

dx 


0 

(p\x\f) =ih—{p\f) 

Bp 


-I 


9 


(tp\x\f} = / dp {p\y)*ih—{p\f) 

dp 

What do these results suggest for how you should represent the position operator 
momentum space? 

6.3. Show for the infinitesimal translation 

m->\t’) = n8xM) 


that 


{x) (x) + Sx and (pj (p x ). 


6.4. 

(a) Show for a free particle of mass m initially in the state 


fix) = (x\f) = 


1 



e ~x 2 Ha 


2 


that 


fix, t) = {x\f(t')} = 


___ e ~ x 2 /{2a 2 [l-t-(iht/ma 2 )]} 

\a + (, iht/ma )] 
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and therefore 



Suggestion: Start with (6.75) and take advantage of the Gaussian integral 
(D.7), but in momentum space instead of position space. 

(b) What is A p x at time f? Suggestion: Use the momentum-space wave function 
to evaluate A p x . 


6.5. Consider a wave packet defined by 


{ P\f) = 


0 p < -P/2 
N —P/2<p<P/2 
0 p>P/2 


(a) Determine a value for N such that (i/r|i/>) = 1 using the momentum-space 
wave function directly. 

(b) Determine [x \ \j>) = t/r(x). 

(c) Sketch (p\i/r) and {x\\j/). Use reasonable estimates of A p x from the form of 
(p 1 1 If) and Ax from the form of (x|i/r) to estimate the product Ax A p x . Check 
that your result is independent of the value of P. Note: Simply estimate rather 
than actually calculate the uncertainties. 


6 . 6 . 

(a) Show that (p x ) — 0 for a state with a real wave function (x|r//). 

(b) Show that if the wave function (x j ij/) is modified by a position-dependent 
phase 


then 


(x) -x (x) and ( p x ) -> (p x ) + p 0 

6.7. In Example 6.2 it was assumed that ( p x ) = 0. Determine the minimum uncer¬ 
tainty wave function if this constraint is relaxed. 

6.8. Establish that the position operator x is Hermitian (a) by showing that 

(y\x\f) = (ir\x\<p)* 

or (b) by taking the adjoint of the position-momentum commutation relation (6.31). 
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6.9. Without using exact mathematics—that is, using only arguments of curvature, 
symmetry, and semiquantitative estimates of wavelength—sketch the energy eigen¬ 
functions for the ground state, first excited state, and second excited state of a particle 
in the potential energy well V(x) = a\x\ shown in Fig. 6.18. This potential energy 
has been suggested as arising from the force exerted by one quark on another quark. 


6.10. The one-dimensional time-independent Schrodinger equation for the potential 
energy discussed in Problem 6.9 is 


d 2 ij/ 

dx 2 


+ -a\x\)xlr = 0 

n~ 


Define E — s(h 2 a 2 /m) 1 ^ and jc == z(h 2 /ma )^ 3 . 

(a) Show that s and z are dimensionless. 

(b) Show that the Schrodinger equation can be expressed in the form 

»2 I 

+2(e - \z\)f — 0 

dz- 


(c) Numerically integrate this equation for various values of e, beginning with 
difs/dz = 0 at z = 0, to find the value of a corresponding to the ground-state 
eigenfunction. 


6.11. Show in Example 6.6 that the probability that the particle is found in the second 
excited state if a measurement of the energy is carried out is given by 

i / jnwidth 2a i r-width a\ i- r\ 

| <£ 3 \E X }| — 2^2 — 

6.12. The normalized wave function for a free particle is given by 


{x\ir) = 


n. 


v -cos 


Ttx 

a 


0 


I* | < a/2 
|x| > «/2 
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Such a state might be created by putting the particle into the ground state of the 
potential energy box discussed in Section 6.9 and then instantaneously removing 
the potential. What is the probability that a measurement of the momentum yields 
a value between p and p + dpi Your final answer should not involve any complex 
numbers, since the probability of having momentum between p and p + dp is a real 
quantity. Simplify your answer as much as possible. Suggest a strategy for measuring 
this probability. 


6.13. Solve the energy eigenvalue equation in position space for a particle of mass 
m in the potential energy well 


V(x) = 


0 0 < x < L 

oo elsewhere 


Show that the energy eigenvalues are given by 

fc2 2 2 

rln n 


2m L 


n = 1, 2, 3, ... 


with the corresponding normalized energy eigenfunctions 





6.14. Determine Ax, A p x , and AxAp x 
of the potential energy well 


elsewhere 

for a particle of mass m in the ground state 


10 0 < x < L 

l oo elsewhere 


6.15. A particle of mass m in the one-dimensional potential energy well 


V(x) = 



0 < x < L 
elsewhere 


is at time t = 0 in the state 


fix) = 




1 




2 TZX 

L 


0 < x < L 
elsewhere 


(a) What is i//(x, t)7 

(b) What is (E) for this state at time tl 

(c) What is the probability that a measurement of the energy will yield the value 

h 2 Jt 2 /2mL 1 l 

(d) Without detailed computation, give an argument that (x) is time dependent. 
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6.16. A particle of mass m is in lowest energy (ground) state of the infinite potential 
energy well 

f 0 0 < x < L 

V(x) = 

l oo elsewhere 

At time t = 0, the wall located at x = L is suddenly pulled back to a position at 
x — 2L. This change occurs so rapidly that instantaneously the wave function does 
not change. 

(a) Calculate the probability that a measurement of the energy will yield the 
ground-state energy of the new well. What is the probability that a measure¬ 
ment of the energy will yield the first excited energy of the new well? 

(b) Describe the procedure you would use to determine the time development of 
the system. Is the system in a stationary state? 


6.17, A particle in the potential energy well 


is in the state 


V(x) = 


0 0 < x < L 

oo elsewhere 


(a) 

(b) 

(c) 


f(x) 


Nx(x — L) 0 < x < L 
0 elsewhere 

Determine the value of N so that the state is properly normalized. 

What is the probability that a measurement of the energy yields the ground- 
state energy of the well? 

What is (£) for this state? 


6.18. 

(a) What is the magnitude of the ground-state energy for the infinite well if the 
confined particle is an electron and the width of the well is an angstrom, a 
typical size of an atom? 

(b) If the particle is a neutron or a proton and the width of the well is a charac¬ 
teristic size of a nucleus, what is the magnitude of the ground-state energy? 


6.19. An interesting limiting case of the finite square well discussed in Section 6.8 
is the case where the well depth approaches infinity but the width of the well goes to 
zero such that V 0 a remains constant. Such a well may be represented by the potential 
energy satisfying 


2m 

1c- 


V(x) = -yS(x) 
b 


where 8(x ) is the Dirac delta function. Note that a/7? is a constant having the units 
of inverse length and we have taken the top of the well to be at V = 0. 
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(a) Show by integrating the time-independent Schrodinger equation that the 
derivative of the energy eigenfunction is not continuous at the origin, but 
satisfies 





(b) Determine the energy eigenvalue(s) for this well. Sketch the energy eigen- 
function(s). 

Suggestion: Solve the Schrodinger equation for E < 0 in the regions x < 0 
and x > 0, where V' = 0, and join the solutions together, making sure that the 
boundary conditions on continuity of the wave function and discontinuity of 
the derivative of the wave function at x = 0 are satisfied. 


6.20. Normalize the wave function 


(*W) = 


Ne~ KX x > 0 
Ne KX x < 0 


Determine the probability that a measurement of the momentum p finds the momen¬ 
tum between p and p + dp for this wave function. Note: This wave function is the 
energy eigenfunction for the delta function potential energy well of Problem 6.19. 


6.21. Calculate the reflection and transmission coefficients for scattering from the 
potential energy barrier 


jy-T(x) = ~5(x) 
fi l b 

Note the discussion in Problem 6.19 on the boundary conditions. 

6.22. Show that the reflection and transmission coefficients for scattering from the 
step potential shown in Fig. 6.14 are'given by (6.135) even when the particles are 
incident on the step from the right instead of from the left. 

6.23. Derive the expression (6.144) for the transmission coefficient for tunneling 
through a square barrier. 


6.24. 

(a) Show that the transmission coefficient for scattering from the potential energy 
well 


V(x) = 


0 x <0 

— V'o 0 < x < a 

0 x > a 
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is given by 


T = 


sin 2 + V 0 )a 

/] E (E+ V 0 ) 

Vo V 0 


Suggestion: What is the transcription required to change the wave function 
(6.142) into the one appropriate for this problem? What happens to the trans¬ 
mission coefficient (6.144) under this transcription? 

(b) Show that for certain incident energies there is 100 percent transmission. 
Suppose that we model an atom as a one-dimensional square well with a 
width of 1 A and that an electron with 0.7 eV of kinetic energy encounters 
the well. What must the depth of the well be for 100 percent transmission? 
This absence of scattering is observed when the target atoms are composed 
of noble gases such as krypton. 


6.25. Electrons in a metal are bound by a potential that may be approximated by a 
finite square well. Electrons fill up the energy levels of this well up to an energy called 
the Fermi energy, as indicated in Fig. 6.19a. The difference between the Fermi energy 
and the top of the well is the work function W of the metal. Photons with energies 
exceeding the work function can eject electrons from the metal—the photoelectric 
effect. Another way to pull out electrons is through application of an external uniform 
electric field E, which alters the potential energy as shown in Fig. 6.19b. Show that 
the transmission coefficient for electrons at the Fermi energy is given by 


T ~exp 


/ —\\plm W 3 / 2 \ 
\ 3e\E\h / 


How would you expect the field-emission current to vary with the applied voltage? 



(a) 



Figure 6.19 (a) A finite square well is an approximation to the potential well 
confining electrons within a metal, (b) Applying a negative voltage to the metal 
alters the potential well, permitting electrons to tunnel out. 
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The One-Dimensional Harmonic 
Oscillator 


In this chapter we turn our attention to a system in which a particle experiences a 
potential energy V (x ) that varies with position in a nontrivial way—namely, the 
simple harmonic oscillator. Not only is this a system for which we can determine 
exactly the energy eigenvalues and eigenstates in a number of different ways, but it 
is also a system with an extremely broad physical significance. 


7.1 The Importance of the Harmonic Oscillator 


What gives the harmonic oscillator such a broad significance? First let’s consider a 
specific example familiar from classical mechanics. A mass m attached to a string of 
length L is free to pivot under the influence of gravity about the point O, as shown 
in Fig. 7.1. The energy of the system c^n be expressed as 


It 1 , 

£ = -m» + mgh — —mV 


+ mgH 1 — cos 9) 


(7.1) 


If the angle 9 is small, we can expand cos 9 in a Taylor series and retain the leading 
terms to obtain 


E = —mi i 2 + -mg 1.9' = -mv 2 + — TA x 2 (7.2) 

2 2 2 2 L 

where we called the arc length LB = x. Thus, provided the oscillations are small, 
the system behaves like a harmonic oscillator with a spring constant k = tng/L and, 
therefore, a spring frequency on — s/k fm — y/g/L. Notice that there is no physical 
spring actually attached to the mass in this case. 


245 
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Figure 7.2 An arbitrary potential energy V (x) 
with a minimum at x — x 0 . 


Let’s now examine any potential energy function V(x) that has a minimum at 
a position that we call x 0 , as shown in Fig. 7.2. Expanding V(x) in a Taylor series 
about the minimum, we obtain 

V(x)=V(x Q )+(^pj (x-x 0)+ if^) (x - x 0 ) 2 + • ■ • (7.3) 

\dx/ x=XQ 2 \\dx-J x=X(} 

Since x n is the location of the minimum of the potential energy, the lirst derivative 
vanishes there and 

V(x) = V (x 0 ) + hc{x - x 0 ) 2 3- (7.4) 

where k = (cl 2 V/dx 2 ) x=XQ is a positive constant. Since it is only differences in 
potential energy that matter physically, we can choose the zero of potential energy 
such that V (x 0 ) = 0. If we now position the origin of our coordinates at ,v 0 , then 

1 7 

V (x) = -kx" + ■ • • (7.5) 
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Thus, provided the system is undergoing sufficiently small oscillations about the 
equilibrium point, we can neglect the higher order terms in the Taylor series ex¬ 
pansion, and the effective potential energy is that of a harmonic oscillator. Good 
examples of systems that behave like harmonic oscillators on a microscopic scale 
are the vibrations of nuclei within diatomic molecules and the vibrations of atoms 
in a crystalline solid about their equilibrium positions. 

In Section 7.2 we will solve the harmonic oscillator using operator methods 
reminiscent of those that wc used in Chapter 3 to determine the eigenstates of angular 
momentum. In the example of angular momentum, we utilized only the commutation 
relations (3.14), without having to specify the type of angular momentum. This 
approach generated solutions that included intrinsic spin in addition to the more 
familiar orbital angular momentum, which we will analyze in Chapter 9. Here too, 
solving the harmonic oscillator with operator methods allows for more abstract 
solutions in which the variable x may not be the usual position at all. In Chapter 14 
we will see that the Hamiltonian of the electromagnetic field may be expressed as a 
collection of such abstract harmonic oscillators. Planck’s resolution of the ultraviolet 
catastrophe in the analysis of the blackbody spectrum, which amounted to treating 
these oscillators as quantum oscillators, can be considered as the starting point of 
quantum field theory. 1 

7.2 Operator Methods 


Our goal is to determine the eigenkets and eigenvalues of the Hamiltonian 

H — -^4- + -ma?x 2 (7.6) 

2m 2 

where we have expressed the kinetic energy of the particle in terms of its momentum, 
as in Chapter 6, and set the spring constant k equal to mar. In addition to the 
expression for H, the only other ingredient required for a solution using operator 
methods is the commutation relation 

[x,p x \=ih (7.7) 

Notice that the Hamiltonian is quadratic in both the position x and the momentum 
p x , just as the operator J 2 is quadratic in the individual components of the angular 


1 Perhaps it is not so surprising that P. A. M. Dirac, who invented the elegant operator approach 
to the harmonic oscillator in 1925, went on to apply these same techniques to the quantization of 
the electromagnetic field as early as 1927, while the details of nonreiativistic quantum mechanics 
were still being worked out. At least it is not surprising if you happen to be as clever as Dirac, 
who in 1928 also developed the relativistic wave equation for spin-1 particles, the famous Dirac 
equation. Not bad for three years’ work. 
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momentum. For the harmonic oscillator we introduce two non-Hermitian operators 


and 




(7.8) 


(7.9) 


in a fashion similar to the way we introduced J± = J x ± i Since x and p x have 
different dimensions, we cannot just add x ± i p x , as we did for angular momentum. 
The factor of y/nuoflh is inserted in front so that the operators (7.8) and (7.9) are 
dimensionless. Using (7.7), we can verify that these operators satisfy the simple 
commutation relation 


[a, « f ]= 1 


(7.10) 


Inverting (7.8) and (7.9), we obtain 


x = 


h ^ 

(a + a ) 


2m co 


and 


A . jmcoh „ „ +x 

Px = ~ l y~2~ (a ~ a ’ 

which can be used to express the Hamiltonian (7.6) as 


hoo 


H = — (a'a + aa) = hoo I a'a -|— 


(7.11) 


(7.12) 


(7.13) 


In the last step we have taken advantage of the commutation relation (7.10). Thus 
finding the eigenstates of H is equivalent to finding the eigenstates of 


N = a'a 


(7.14) 


often called the number operator, for reasons that will be apparent shortly. 

Let’s temporarily denote the eigenstates of N by \rj): 

N}*l) — Q\n) (7.15) 

The expectation value of the number operator in an eigenstate is given by 

(n\N\ri) = {n\a^a\r)) = (7.16) 


Calling 


a\n) = \ f) 


(7.17) 
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we can express equation (7.16) as 


{f\t) = rj(r)\rj) (7.18) 

Since (ir\ir) > 0 and {rj\r}) > 0, (7.18) shows that the eigenvalue rj > 0. 

It is the commutation relations 

[N, a] = [fi t B, a] 

= [d a] a 

= -a (7.19) 

and 

[N, o t J = [a* a, a' [ 

= a\a, d^] 

= d f (7.20) 

which follow from (7.10), that make the operators a and a ' so useful. Compare with 
the similar relations (3.39) for angular momentum. To see the action of a'- on the ket 
| rj), we evaluate Na ' \r}). In order to let the operator N act on its eigenstate, we use 
the commutation relation (7.20) to switch the order of the operators, picking up an 
extra term because the operators do not commute: 

iVa + |i?> = (a f N +d t )|r?> 

= (a f i]+a J )\r]) 

— (rj + l)a^\rj) (7.21a) 

We can make the action of the operator a r more transparent with the addition of 
some parentheses to this equation: 

^(d t |/7}) = (7?+l)(d t b>)) (7.21b) 

indicating that 

= c + |»7 + 1) (7-22) 

that is, a r |?/} is an eigenket of N with eigenvalue rj+ 1. Thusd 1 ^ is a raising operator. 
Similarly, 

Na\rj) = (aN — a)\rj) 

= (17-1)519) (7.23) 

so a is a lowering operator: 

0|9)=*_|i7-l) (7.24) 


Page 265 (metric system) 



250 | 7. The One-Dimensional Harmonic Oscillator 


Unlike the case of angular momentum, where there are limits on both how far we 
can raise and how far we could lower the eigenvalues of./,, the only limitation here 
comes from the requirement that rj > 0. Thus there must exist a lowest eigenvalue, 
which we call r/ min . The ket with this eigenvalue must satisfy 

a\h mm )=0 (7.25) 

for otherwise a\ p min ) = c[p min — 1), violating our assumption that p min is the lowest 
eigenvalue. However, if we apply the raising operator to (7.25), we obtain 

® a I*7min) ^minl^min) ~~ 0 (7.26) 

where the middle step follows from the fact that |r/ min ) is an eigenstate of N = a f a. 
Since the ket |p min ) exists, (7.26) requires that rj mm — 0. Thus we label the lowest 
state simply as |0). Applying the raising operator n times, where n must clearly be 
an integer, generates the state | n) satisfying 

N\n)=n\n) n= 0,1,2,... (7.27) 

Thus the eigenvalues of the number operator are the integers—hence the name for 
this operator. The eigenvalues of the Hamiltonian are determined by 

H\n) = Hco(^N + ^j \n) = ha)(n + ^j \n) = E„\n) n= 0,1,2,... (7.28) 

The energy of the harmonic oscillator is thus quantized, taking on only discrete 
values. This characteristic energy spectrum of the harmonic oscillator is shown in 
Fig. 7.3. Notice that there is a uniform spacing between levels. 


V 



Figure 7.3 The energy spectrum of the harmonic oscillator su¬ 
perimposed on the potential energy function V (x) = mco 2 x 2 /2. 
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Figure 7.4 The planar structure of the ethy¬ 
lene molecule, C 2 H 4 . 


EXAMPLE 7.1 A molecular system that exhibits the energy spectrum of 
the harmonic oscillator with an interesting twist is the torsional oscillation 
of the ethylene molecule, C 2 H 4 . In the ground state, the six atoms of this 
molecule all lie in a plane, as shown in Fig. 7.4. The angle between each of 
the adjacent C-H and C-C bonds is roughly 120°. It is possible to rotate one 
of the CH 2 groups with respect to the other by an angle 0 about the C-C axis, 
as shown in Fig. 7.5a. If we rotate by an angle 0 = :r, the molecule returns to 
a configuration that is indistinguishable from the 0 = 0 configuration. Thus 
0 = 0 and 0 = rr must be minima of the potential energy of orientation. 
These two minima are separated by a potential barrier, as shown in Fig. 7.5b. 
A simple approximation for this potential energy function is given by 

V (0) = — (1 — cos 20) 

As discussed in Section 7.1, in the vicinity of each of the minima, the system 
behaves like a harmonic oscillator. How is the energy spectrum modified 
from that given in Fig. 7.3 by the fact that the CH 2 group can tunnel between 
these two minima? 

SOLUTION In our discussion of the ammonia molecule in Section 4.5, we 
saw that in the absence of tunneling there would be a two-fold degenerate 
ground state with energy £ 0 , arising from the fact that the energy of the 
N atom above the plane formed by the three hydrogen atoms is equal to the 
energy of the N atom below the plane. However, when tunneling is taken into 
account, this energy level splits into two different energies, one with energy 
Eq — A and one with energy £ 0 + A. In the case of the ethylene molecule, if 
tunneling between the two minima is neglected, the low lying energy levels 
should be that of a harmonic oscillator, as indicated in Fig. 7.6a. However, 
since the potential barrier between these configurations is not infinite, the 
CH 2 group can tunnel between them. As in the ammonia molecule, this 
tunneling causes each of the two-fold degenerate energy levels to split into 
two distinct energy levels, with a small spacing between them proportional 
to the tunneling amplitude. Since both the distance in energy below the top of 
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V 




Figure 7.5 (a) A view along the C-C axis of the C 2 H 4 molecule showing one of the CH 2 
groups rotated relative to the other by angle < p. (b) The potential energy of the molecule as 
a function of <j>. 


E 

- E 2 


E 


E i 


- £() .=====. 

(a) (b) 

Figure 7.6 (a) The three lowest energy levels of the C 2 H 4 molecule, 
neglecting tunneling, (b) The energy spectrum with tunneling taken into 
account. 


the barrier and the width of the barrier decrease as the energy of the system 
increases, the magnitude of this splitting grows [see (6.149)J as the quantum 
number n increases, as sketched in Fig. 7.6b. 


7.3 Matrix Elements of the Raising and Lowering Operators 


It is useful for us to determine the constants c + in (7.22) and c_ in (7.24). For 
example, the bra equation corresponding to the ket equation 


a T |n) = c + \n + 1) 


(7.29a) 
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is given by 

{n\a — c* + {n + 1| (7.29b) 

Taking the inner product of these equations, we obtain 

(n\aa'\n) = (n\(a*a + l)|n) = (n + l)(n|n) 

= c* + c + (n + l\n + 1) (7.30) 

If the eigenstates are normalized, that is, they satisfy (n\n) = 1 for all n, we can 
choose c + = + 1, or 

a t |n) = Vn + 1 \n + 1) (7.31) 

Similarly, we can establish that 

a\n) = sfn \n — 1) (7.32) 

Thus the matrix elements of the raising and lowering operators are given by 

(n'\a r \n) = s/n + 1 S n , n+1 (7.33) 

and 

(ri\a\n) = (7.34) 

The matrix representations of the raising and lowering operators using the energy 
eigenstates as a basis are then given by the infinite-dimensional matrices 

/ 0 0 0 ...\ 

Vi o o 

a f 0 V2 0 (7.35) 

0 f£) 

o vT o o 

0 0 y/2 0 

0 0 0 x/3 

It is straightforward to construct the matrix representations of the position and the 
momentum operators using (7.35) and (7.36). 

We can also establish (see Example 7.2) that a normalized ket \n) can be ex¬ 
pressed as 

(aV 

|n) = i 7 =-|0> (7.37) 

v«! 
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Finally, notice that increasing the energy of the harmonic oscillator from E n to 
E n+ 1 requires the addition of energy ha> to the oscillator. In Chapter 14 we will 
see that the electromagnetic field is composed of abstract harmonic oscillators. In 
that case the natural interpretation is that the state with energy E n is composed of n 
photons and the state with energy E n+l is composed of n + 1 photons. The additional 
energy hto is exactly the quantum of energy that we expect for a photon with angular 
frequency to. For photons, instead of referring to a T as a raising operator, we will call 
it acreation operator. Similarly, the operator a will be referred to as an annihilation 
operator, since when a acts on a state \n). it decreases the number of quanta in the 
state from n to n — 1. 

■ . ■ ■ ' > ■ . ■■ ■ , ■ 

EXAMPLE 7.2 Verify that the states 

|l)=a f |0) and |2) = !—C|0) 

V2! 

are properly normalized. 

SOLUTION We assume that (0|0) = 1. Then 

(i|i) = (oiaa t |o> = (OK^a + i)io) = <oko + i>io> = <oio> = 1 

where we have taken advantage of the commutation relation [a, a t ] = 1. 

Similarly, 

(2|2) = = ^\(^a + 1)|1) = |(1|(1-+- 1)|1) = (1|1) - 1 

The extension of these results to the general case can be established by 

induction. See Problem 7.3. 


7.4 Position-Space Wave Functions 


It might appear that we are far removed from the wave functions of wave mechanics, 
but in fact we can obtain the position-space (and momentum-space) energy eigen¬ 
functions for the harmonic oscillator easily from the results that we have obtained 
so far. We start with the ground state. The ground-state ket satisfies 

a|0} = 0 (7.38) 

Projecting this equation into position space, we obtain 

(x\a |0) = , /“ {x | (x + — p ) |0) = 0 (7.39) 

V 2ft. \ mco J 
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Recall from (6.42) that 


(x\p x \0) = 


hd{x\ 0) 
i dx 


(7.40) 


where (jc] 0> is the amplitude to find a particle in the ground state with position x. Also 

{jc|*|0)=Jt(jc|0) (7.41) 

where we have allowed the Hermitian operator x to act to the left on its eigenbra. 
Thus (7.39) reduces to the first-order differential equation 2 

d(x |0) m cox 


dx 


h 


-(-v|0> 


which is easily solved: 

( jc | 0 ) = Ne~ mcox2l2h 

Normalizing [see (6.61)], we obtain 

(x | 0 ) = (—) ' e- mo>x2/2h 

\ nh ) 


(7.42) 


(7.43) 


(7.44) 


Once we have determined the ground-state wave function, we can take advantage 
of (7.37) to determine all of the position-space energy eigenfunctions: 

(x\n) = -^(jcKaVIO) 


ma> 

~2H 


x — 


h_d_Y 

mto dx 


ma) 

nh 


1/4 


-m(ox z f2h 


For example. 


<*|1> = 


4 / mm \' 

n \T) 


1/4 


xe 


-mcox^/2h 


(7.45) 


(7.46) 


(x\2) = ( —V /4 (2*™x 2 - l ) e- m ^ 2h {1A1) 

\ 4tt h} \ h } 

The energy eigenfunctions and the corresponding position-space probability den¬ 
sities | (jc | /i) | 2 for these states, as well as those with n = 3, 4. and 5, are plotted in 
Fig. 7.7. 

These eigenfunctions exhibit a number of properties worthy of note. The number 
of nodes, or zeros, of {x\n) is n. The increasingly oscillatory character of the 


2 Since {jc]0} is a function of x only, we can replace the partial derivative with an ordinary 
derivative. 
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Figure 7.7 The wave functions (x\n) and the probability densities |(x|n)| 2 plotted for 
the first six energy eigenstates of the harmonic oscillator. The classical turning points at 
x„ = yf(2n + are determined from (7.59). 
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functions as n increases reflects the increasing kinetic energy of each of these states. 
The expectation value of the kinetic energy is given by 



HL 

2m 



d 2 

dx {n\x )— r(x| n) 
dx 2 


(7.48) 


As the number of nodes of (x|n) increases, the curvature of the eigenfunction 
increases and hence the second derivative in (7.48) takes on larger values. The 
expectation value of the potential energy 

1 f°° 1 

(V(x)) = -mu) 2 / dx (n\x)x 2 {x\n) — -mar 1 dx x 2 |(x|n)j 2 (7.49) 

2 J—OO 2 J— 00 

also increases with increasing n as the region over which the eigenfunction is 
appreciable increases. 


7.5 The Zero-Point Energy 


One of the most striking features of the harmonic oscillator is the existence of 
a nonzero ground-state energy E 0 = ficu/2, known as the zero-point energy. In 
classical mechanics the lowest energy state occurs when the particle is at rest (p x = 0 
and hence zero kinetic energy) with the spring unstretched (x = 0 and hence zero 
potential energy ). In the real world this configuration is forbidden by the Heisenberg 
uncertainty relation, which enters into the solution of the harmonic oscillator through 
the commutation relation (7.7). The particle in the ground state, and in fact in any 
of the eigenstates, has a nonzero position uncertainty Ax [(Ax) 2 = (x 2 ) - (x) 2 ] 
as well as a nonzero momentum uncertainty A p x [(Ap A .) 2 = {p 2 ) — (p x ) 2 ]. It is 
straightforward to see how these uncertainties affect the value of the ground-state 
energy. For any state 


( P 2 ) 1 - ,, 

(E) = ^ + -m«V) 
2m 2 


(A ' , «T+ (ft*' + + 

2m 2 


(7.50) 


There are a number of ways to establish that (x) and (p x ) both vanish in an energy 
eigenstate of the harmonic oscillator. One way is through explicit evaluation: 


(n|x|n) = v /-2— (n\(a+d T )\n) 
Zmo) 


ft 


V 2m co 


{s/n {n\n — 1) + s/n + 1 (n\n + 1)) = 0 (7.51) 


/ j- 

(n\Px\ fl ) = - a r )\n) 


— (Vn (n\n — 1) — s/n + 1 (n\n + 1)) = 0 (7.52) 
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Thus in an energy eigenstate 


(E) = + -mco 2 ( Ax) 2 (7.53) 

2m 2 

How does nature keep the ground-state energy as small as possible? Clearly, 
localizing the particle at the origin and minimizing the potential energy will not work 
because as Ax 0, A p x —*■ oo in order to satisfy Ax A p x > h/2. Similarly, trying 
to put the particle in a state with zero momentum to minimize the kinetic energy 
implies A p x —> 0, which forces Ax -» oo. Thus nature must choose a tradeoff in 
which the particle has both nonzero Ax and A p x and, therefore, nonzero energy. 
Explicitly, for the ground state 


and 


(Ax) 2 = —(0|(a + a f ) 2 |0) 

2m co 

j- 

= <0|[a 2 + (a') 2 + atf + a f a]|0> 

2m co 

= — (0|(5zi + 10) = ——— (111) = r~~~ (7.54) 

2m co 2m co 2mco 


(A/a) 2 


moo H 
2 

mcoh 


(0|(a — a f ) 2 |0> 

<0|L5 2 + (5 t ) 2 -a« t -« t a]|0) 


mcofi 

o 


(oiaa t |o> 


mwh /ulx mcoh 
2 ~ 2 


(7.55) 


Notice that Ax A p x = h/2 for the ground state. That the ground state is a minimum 
uncertainty state was already apparent from the Gaussian form of the ground-state 
wave function (7.43), given the discussion in Section 6.6. For the excited states, we 
can establish in a similar fashion that 


and 



(7.56) 

(7.57) 


(7.58) 


A good illustration of the effects of this zero-point energy is the unusual behavior 
of helium. Helium is the only substance that does not solidify at sufficiently low 
temperatures at atmospheric pressure. Rather, it is necessary to apply a pressure of 
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at least 25 atmospheres. For substances other than helium, the uncertainty in the 
position of the nuclei in the ground state is in general quite small compared to the 
spacing between the nuclei, which is why these substances solidify at atmospheric 
pressure at sufficiently low temperature. In fact, increasing the temperature populates 
the higher vibrational states and increases the uncertainty, as (7.56) indicates. These 
substances melt when the uncertainty becomes comparable to the spacing between 
the nuclei in the solid. For helium, even in the ground state, the uncertainty is large 
because of two factors: the small mass of helium and the small value of to (because 
of weak attraction between the helium atoms in the solid). Thus Ax is too large for 
helium to solidify at low temperature at a pressure of one atmosphere. Increasing 
the pressure reduces the separation between the helium atoms, thereby increasing to 
and reducing Ax, so that at high pressure helium solidifies. 


7.6 The Large-n Limit 


The existence of a zero-point energy and the discrete energy spectrum (7.28) of 
the harmonic oscillator are purely quantum phenomena. Why don’t we notice this 
discreteness in a macroscopic oscillator such as the pendulum of Section 7.1? The 
answer resides in the smallness of Planck’s constant on a macroscopic scale. For 
example, the angular frequency of the pendulum is co — Jg/L. Thus if L — 10 cm, 
a) is about 10 radian/s and the spacing hco between energy levels is 10~ 26 ergs. If 
the energy E of the pendulum is a typical macroscopic value, such as 1 erg, then 
hco IE = 10“ 26 and the system appears to have a continuous energy spectrum, which 
is what we would expect classically. Note in this case the quantum numbers = 10 26 . 
This suggests that the classical limit is indeed reached in the large-/; limit. 

The classical motion of a particle in a state of definite energy E n is restricted to 
lie within the classical turning points, which are determined by the condition that at 
these points all the energy is potential energy, with zero kinetic energy: 

E n — fn + ^ J hco = ^mftTx 2 (7.59) 


as shown in Fig. 7.8. Examination of the energy eigenfunctions in Fig. 7.7 shows 
that the eigenfunctions extend beyond these classical turning points, but that these 
excursions become less pronounced as n increases. Here again, we see the classical 
limit being reached in the limit of large n. The chance of finding a particle between 
x and x + dx for a classical oscillator with energy E n is proportional to the time 
dx/v that it spends in the interval dx, where v is the speed of the particle. Taking 
advantage of (7.59), we may express this classical probability as 


P d dx oc 


dx 


dx 




(7.60) 


Page 275 (metric system) 



260 | 7. The One-Dimensional Harmonic Oscillator 



Requiring that the total probability of finding the particle between +x„ and —x n is 
unity determines the normalization: 


Pci dx = 



(7.61) 


The probability density |{x|«)| 2 for large «, as well as P cl , is plotted in Fig. 7.9. As 
n increases, the number of nodes of the wave function increases. For sufficiently 
large n, the quantum state is oscillating so rapidly on a macroscopic scale that 
only its mean value can be detected by any set of position measurements. In this 
case, the agreement between the predictions of quantum mechanics and classical 
mechanics is excellent. You can guess how good the agreement is for the pendulum 
example with n = 10 26 . This is a nice example of the correspondence principle. 


K . ri «>| 2 



Figure 7.9 A plot of the probability density | (x\n )| 2 for large n. The 
dashed line is a plot of the classical probability density from (7.61). 


Page 276 (metric system) 





7.7 Time Dependence | 261 


first enunciated by Niels Bohr, that the predictions of quantum mechanics should 
agree with those of classical physics in domains where classical physics works. 

7.7 Time Dependence 


A harmonic oscillator in an energy eigenstate is in a stationary state. Thus it will not 
exhibit the characteristic oscillatory behavior of a classical oscillator. Time depen¬ 
dence for the hannonic oscillator results from the system being in a superposition 
of energy eigenstates with different energies. If we assume the initial state is a su¬ 
perposition of two adjacent energy states, 

|i/r(0)} = c n \n) + c n+ i\n + 1) (7.62) 


then 

- e“' ( ” +V2) “ f (<?» + c n+] e~ im \n + 1>) (7.63) 

In particular, we can take advantage of the expression (7.11) for the position operator 
in terms of the raising and lowering operators to evaluate the expectation value of 
the position of the particle to show that 

(x) = A cos(cot + 8) (7.64) 

This genera] case is examined in Problem 7.9. In Example 7.3 we restrict our 
attention to a superposition of the ground state and the first excited state. 


EXAMPLE 7.3 Take 

!VA0)} = 4=i°} + -7=|1> 

V2 V2 

a 50-50 superposition of the ground state and first excited state. Show that 

(x) — cos cot 

SOLUTION 




e ~ioit ji 

~~7T 


m + 


~3icot/2 


V2 


■ID 


:|0} + 


72 v/2 


( ID 
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Figure 7.10 (a) The probability density | ({x|0) + e la> '(x|l))/-%/2! 2 at t — 0, (b) a quarter 
period later at t = jt/2oj, and (c) at t = ji/co, a half period later. 


Then 

(x) = (x{f\x\xfr) 

= (< 01+e '“ <11 ) ( a + 3 *) + 

= ((01 +f'"(l|) (|1) + e-'"|0) + e-'"72 |2)) 

h /e iwt + e" iwt 
2mo) \ 2 

If we call A = y/h/(2mu)), then 

| (x) = A cos cot 

Thus, as for the classical oscillator, the period of the oscillation is T = 2n /co. 

Figure 7.10 shows the corresponding probability density at the times 
t = 0, t = r/4, and t — 7/2. Although the particle oscillates back and forth 
in the potential energy well, the position of the particle is not very well 
localized. In the next section we will examine a very special superposition of 
energy eigenstates for the harmonic oscillator for which the wave function is 
a pure Gaussian, like the ground state, but unlike the stationary ground state, 
the position varies harmonically with time. 



7.8 Coherent States 


There is a superposition of energy eigenstates of the harmonic oscillator that is 
termed a coherent state. Coherent states are eigenstates of the lowering operator 
a, namely 


a\a ) = of | or) 


(7.65) 
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Since a is not a Hermitian operator, the eigenvalue a need not he real. We will 
see in Chapter 14, in our discussion of quantization of the electromagnetic field 
(where the lowering operator becomes the annihilation operator for a photon), that 
coherent states come closest to representing classical electromagnetic waves with a 
well-defined phase. And one can make the case that for the mechanical harmonic 
oscillator a coherent state comes closest to the classical limit of a particle oscillating 
back and forth in a harmonic oscillator potential. The coherent state was first derived 
in 1926 by Schrodingerin his efforts to find solutions to (what else?) the Schrodinger 
equation that satisfy the correspondence principle. 

We start by writing the state |a) as a superposition of the states | n): 

OO 

l a > == ]C c » (7.66) 

n= 0 

Taking advantage of the fact that a\n) = Jn\n- 1>, (7.65) becomes 

00 OO 

Vn C n \n - 1) = a ^ c„\n) (7.67) 

«=• n=0 

Since 

°° OO 

\) = Y J '/*T\c n , +l \n') (7.68) 

"=• n '=0 

where n' — n — 1, (7.67) can be written as 

00 OO 

T', + lc„+il«) =a ^ c n \n) 

«=0 n=0 

where we have renamed n' as n. Equation (7.69) requires that 

_ 

V« +Tc „ +1 = a c n 

Let s look at the first few terms to see the pattern. Since 

c i=" c o (7.71) 

and 

V 2 c 2 =a c l = a 2 c 0 (7.72) 

therefore 

2 

C2 = 7T° (7.73) 


(7.69) 


(7.70) 


Page 279 (metric system) 



264 | 7. The One-Dimensional Harmonic Oscillator 


And since 


therefore 


In general 


Consequently 


V3 


c 3 = ac 2 




a 


V3! 



a 


:f oE 

n —0 


a 


I n) 


n ! 


(7.74) 


(7.75) 


(7.76) 


(7.77) 


The constant c 0 is determined by normalization. The bra (a| corresponding to the 
ket la) is given by 


Consequently, 


(a | 


E 


(»•>’ 

\fn\ 


(n I 


(7.78) 


(a|a> = |c 0 r 


E 

. n '=0 


(a*)' 1 


<«'l 


' W ! 


E 

.B=0 


a 


l«> 


n: 


I c 'o! 2 


C° , ,2» 

ar 


n=0 


/i! 


|c 0 | 2 e |a|2 (7.79) 


Requiring that (a|a) = 1 means that |c 0 | 2 = e or c 0 = I^l 2 / 2 U p t 0 an overall 

phase factor. So finally 


|a) = e^fi ^ 

n —0 



(7.80) 


TIME EVOLUTION OF A COHERENT STATE 

Our goal here is to determine how the coherent state evolves in time. We will show 
not only that the coherent state is the minimum uncertainty state—consequently, a 
Gaussian in position space—but that, amazingly, it maintains this shape as the state 
oscillates back and forth in the potential energy well. This is in marked contrast to 
the oscillatory behavior of the superposition of the ground state and the first excited 
state that we examined in Example 7.3. (See Fig. 7.10.) 

We start by applying the time development operator to the ket |a): 

\a(t)) =e~ iH,/n \a) (7.81) 
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Thus 


a 


t _°°. rn n l(« + l/2)o>r 


n=0 


nl 


= f io *P e -\ a rt* 


E 

n—0 

oo 


V«~! 


I«> 




E 

n= 0 


V«~! 




(7.82) 


Therefore, apart from the overall phase factor e ~ iwt / 2 , the eigenvalue a of the 
lowering operator becomes ae~ la>! as time progresses. Thus even if the eigenvalue 
is real at / = 0, it becomes complex as the state evolves in time. 

To show that the coherent state is the minimum uncertainty state, we need to 
evaluate Ax and A p x . We start by determining (x) and (p x ). Just as we did in 
Section 7.5, we can express the position and momentum operators in terms of a and 
ci to make the calculations especially straightforward. First note that the eigenbra 
equation corresponding to the eigenket equation (7.65) is 


= {aja* 


(7.83) 


Therefore 


(x) = (a(?)|x|a(r)) 


h 


2m co 


(a(t)\(a +a f )|a(0) 


2mco 


[a(0 +a*(r)] 


V 2 m w ^ 


ae~ iu)t + a*e iM \ 


(7.84) 


If we express a in the fonn 


a = \a\e~ iS (7.85) 

w'here |a| is the magnitude of the (in general) complex number cr and 3 gives it phase, 

{x) = I 2\a\cos(wt + S) (7.86) 

V 2m co 
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Similarly, 


( Px) = ( a{t)\p x \a(t)) 

_ _ a + )| a (/)) 

= ~'\J ~“ [“(0 ~ «*(')] 

= -iJ^ (ote~ la>t — a*e ia>t ^ 

Again, expressing this expectation value in terms of |or | and <5, we see that 


(Px) = 


i m co fi 

V 2 


2\a\ sin(a>f + 5) 


(7.88) 


Thus the position and momentum are indeed oscillating back and forth as you 
would expect for motion in a harmonic oscillator potential. Notice how the phase 8 of 
the eigenvalue cc determines the phase of the expectation values (x) and (p x ) in this 
oscillation. You can verify that these expectation values obey Ehrenfest’s theorem. 
See Problem 7.18. And in our derivation of (7.86) and (7.88) you can see how the fact 
that the coherent state is an eigenket of a and an eigenbra of d + makes the calculation 
of these expectation values especially straightforward. 

Now let’s determine the uncertainties Ax and A p x . Note that 



h 

2m co 

H 

2mco 

k 

2 mco 
h 

2m co 


(a(t)|[d 2 + (d f ) 2 + aa t + a 'dl|a(f)) 
{a(t)\\cr + (d 1 ) 2 + 2 d'd + l]|a(0) 
\a(t) 2 + a*(t) 2 + 2|a(f)| 2 + ll 


(7.89) 


and 



mcoh 

2 

mcoh 
2 " 



(a(r)|(d - d f ) 2 |a(r)) 

(a(r)|[d 2 + (d 1 ^ 2 — aa — a'a]\a{t)) 
{a{t)\\a 2 + (d f ) 2 - 2d f d - l]|a(0) 
2|a(0| 2 + 1 — ot{t) 2 — a*(f) 2 l 


(7.90) 
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and therefore 


(Ax) 2 = <x 2 ) - ( x ) 2 


h 


2m co 

h 

2 ma> 
h 

2mco 


a(ty + a*(t) 2 + |a(0| 2 + I - [a(f) + a*(0] 2 


j^a(/) 2 + a*(t) 2 + 2|a(r)| 2 + 1 - a(t) 2 - a*(t) 2 - 2|a(r)| 2 


(7.91) 


and 


(A Px ) 2 = (P 2 X ) - (. Px ) 2 


X' 

mojh 

2 

mcoh r 


[2|a(r)| 2 + 1 - a(t) 2 - a*(t) 2 + [a(t) - a*(t)f 


^ [ 2 |or(/)| 2 + I - a(t) 2 - a*(t) 2 + a(t) 2 - 2\a(t)\ 2 + a*(r) 2 ] 
mcoh 


(7.92) 


Thus both Ax and A p x are independent of time. Perhaps most strikingly, the product 
of these uncertainties is given by 


AxAp x = 




(7.93) 


Thus a coherent state is a minimum uncertainty state. In Example 6.2 we proved 
that in position space the minimum uncertainty state is a Gaussian. But unlike the 
Gaussian wave packet for the free particle that we analyzed in Section 6.6, where 
Ax increases with time, here the minimum uncertainty state remains the minimum 
uncertainty state. This Gaussian does' not spread with time, as it did for the free 
particle. Thus the wave function for a particle in a coherent state oscillates back and 
forth in the potential energy well without changing its shape—without dispersion, 
as indicated in Fig. 7.11. This is as close as we are going to get to a quantum 
state that might represent, for example, the motion of the classical pendulum bob in 
Section 7.1. 


EXAMPLE 7.4 The ground-state energy eigenfunction of the harmonic 
oscillator 

\ mcox~/2h 

7 th) 
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Figure 7.11 The wave function for a co¬ 
herent state is a Gaussian that moves 
between the turning points without dis¬ 
persion. 


is a Gaussian. Show that we can generate the coherent state |a) by simply 
displacing the ground state from its equilibrium position by a distance d by 
applying the translation operator 

f(d) = e~'P xd / >i = eV" 

to the ground state, that is, 

f (r/)|0) = |a) 

provided we set a = *Jmco/2h d. 

SOLUTION This is a subtle problem. Notice that if we set a — «Jmco/2h d 
at t — 0, it becomes complex as time evolves. With this in mind, it is helpful 
to introduce a generalized translation, or displacement, operator 

D(a) = e (a<5t -“* a) 


which reduces to 

f(d) = e~‘P xd / n = e 'J mw / 2h d(a f -a) 

when a = s /inco/2h d. To establish that 

D(a) |0) = |cf) 

we need to make use of the identity 

e A+B =e A e B e -[lB]/2 

which holds when the operators A and B each commute with the commutator 
[A, B], See Problem 7.19. In our case, we take A = aa^ and B = —a*a. Since 
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the commutator [a, o + J = 1, which clearly commutes with a and a\ we can 
safely apply this operator identity in this case. Therefore 


D(a) 


^—a*a {aa*,a*a\/2 


and consequently 


D(a) |0) 


,-|«l 2 /2 


E 

n =0 


I n) = |a) 


In deriving this result we have made use of the Taylor series expansion 
for the operators e aa and e~ a * a . Thus displacing the ground state from its 
equilibrium position generates a coherent state, a Gaussian wave function 
that oscillates back and forth, as indicated in Fig. 7.11. In terms of the 
displacement distance d, you can verify that (x) = d cos cot. 




7.9 Solving the Schrodinger Equation in Position Space 


There is another technique for determining the energy eigenvalues and the position- 
space eigenfunctions of the harmonic oscillator that we will find particularly useful 
when we solve the three-dimensional Schrodinger equation in Chapter 10. Rather 
than take advantage of the operator techniques of Section 7.2, we solve the energy 
eigenvalue equation 

(x\H\E) ={x\(^ + \ma?x 2 \ \E) = E(x\E) (7.94) 

\ 2m 2 / 


directly in position space, as in Chapter 6. Using the results of that chapter, we can 
express this equation as 

Jr 2 j2 1 

— ——-(x\E) + ~mco 2 x 2 (x\E) = E{x\E) (7.95) 

2mdx 2 2 

The position-space energy eigenvalue equation (7.95) is a nontrivial second- 
order differential equation. To make its structure a little more apparent, it is good to 
introduce the dimensionless variable 


y = 



(7.96) 


where the factor, Jnuo/fi is a factor with the dimensions of inverse length that occurs 
naturally in the problem. We call the wave function 


(x\E) = f(y) 


(7.97) 
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where the energy eigenvalue E is implicit on the right-hand side. Expressed in terms 
of these variables, the differential equation (7.95) becomes 

d 2 \i/ 7 

—y + (£ ~ vM = 0 (7.98) 

dy 

where s = 2E/hco is a dimensionless constant. Although this equation does not look 
especially complicated, it is difficult to extract the physically acceptable solutions. 
A good procedure for resolving this difficulty is to explicitly factor out the behavior 
of the wave function as |y| -> oc. In this limit we can neglect the term involving e, 
and the differential equation becomes 

d 2 \lr T 

~rY-yV = 0 (7.99) 

d y- 

The solution to this equation is 

f = Ae~ y2/2 + Be yl/2 (7.100) 


We immediately discard the exponentially increasing solution as |y| -» oo because 
we are searching for a normalizable state satisfying {xp- \ f) = 1. In fact, in the limit of 
large y, we can take any power of v times the decreasing exponential as an asymptotic 
solution of (7.99): 


—AAy m e- yl t 1 ) = Ay m+2 1 
dy 1 


2m + 1 m(m — 1)1 _a n 
- 1 - e y ‘ 

y l j 4 J 


f m+2 e -y /2 ; 


y 2 (Ay m e~ y2/2 ) 


(7.101) 


With this in mind, we express the wave function in the form 


i/r(y) = h(y)e yl12 


(7.102) 


If we substitute (7.102) into (7.98), we find that h(y) satisfies the differential equation 

~-2y^ + (e-l)h = 0 (7.103) 

dy dy 

It should be stressed that we have not made any approximations in arriving at (7.103). 
We can think of (7.102) as just a definition of the function h. Although this equation 
for h does not look any simpler than equation (7.98) for i jr, we can now obtain a 
power-series solution of the form 

OO 

h(y) = ^2 a k y k (7.104) 

k= 0 
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to this equation. To see this, we substitute (7.104) into (7.103) to obtain 

OO CO CO 

£ k(k - 1 )a k y k ~ 2 - 2 £ ka k y k + (e - 1) £ a k y k = 0 (7.105) 

k =0 k —0 k =0 

Notice that although the first term nominally starts with k = 0, the k(k — 1) factor 
in this summation vanishes when k — 0 and k = 1. Thus this sum really starts with 
k — 2. We can give the summation index any symbol we want without changing the 
intrinsic meaning of the sum. If we let k — 2 — k' in this summation, we obtain 

OO OO 

k(k - 1 )a k y k ~ 2 = + 1 )«t '+2 y k ‘ (7.106) 

k =2 k '=0 


We can rename the summation index k’ as k in (7.106) and substitute this result into 
(7.105), which then becomes 


[(Jk + 2)(it + 1 )a k+2 - 2 ka k + (e - 1 )a k ] y k = 0 (7.107) 

k =0 

where we have now been able to factor out a common factor of y k in each term, which 
was our goal. Since the functions y k are linearly independent, the only way (7.107) 
can be satisfied is for the coefficient of each y k to vanish. Thus we obtain a two-term 
recursion relation: 

^ = . 2 - k + ] Zf (7.108) 

a k (k + 2)(k + 1) 


This recursion relation completely determines the power-series solution given a 0 and 
a\. If we choose a { — 0, the solution will be an even function of y. On the other hand, 
if we choose a 0 — 0, the solution will be an odd function of y? 

In general, (7.108) leads to an infinite power series, which for large k behaves as 


®k+2 2 

- ~ 

k 


(7.109) 


This is the same behavior that the function 

, 00 ,,2n 00 

n=0 ' k=0 

exhibits, since for this function h k = \/(k/2)l and thus 


(7.110) 


^ _ ,1 k = 0,2 ,4 

b k [(ik/2) + 1]! (Jk/2) + 1 t-oo k 


3 If we attempt to find a solution to (7.98) through a power series of the form J2k a ky k ’ we 
obtain a three-temi recursion relation instead of a two-term recursion relation, such as (7.108), 
which is why we switched to solving for h(y) instead of solving for y7(v) directly. 
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In fact, this same asymptotic behavior is exhibited by any power of y times exp(y 2 ). 
Since the large-y behavior of these series is determined by the behavior for large k. 
the series solution for h(y) generates the leading large-y behavior 

rfr -> Ay'V v V-^ 2 = Ay m e yl/2 (7.112) 

Ly|-*oo 

that we tried to discard when we attempted to find a solution of the form (7.102). 

Are there, therefore, no solutions to the differential equation that are exponen¬ 
tially damped and hence satisfy the normalization requirement? The only way to 
evade (7.109) is for the series to terminate. If 

s — 2n + 1 (7.113) 

where n is an integer, then a k+2 = 0 for all k > n. Consequently, ^ is a finite 
polynomial in y multiplied by a decreasing exponential. Since s = 2E/hco, we see 
that 

E n = (n + tiw n =0, 1,2,... (7.114) 

This is the same result that we obtained earlier using operator methods. 

The function h(y) is thus a polynomial of order n, called a Hermite polynomial. 
We can determine the form of these polynomials either from the power-series so¬ 
lution (see Example 7.5) or from our earlier result (7.45). The first three Hermite 
polynomials can be seen in the energy eigenfunctions (7.44), (7.46), and (7.47). 


EXAMPLE 7.5 Use (7.108) to determine the first three Hermite poly¬ 
nomials. 

SOLUTION Substituting £ = In f 1 into (7.108), we obtain 

a k +2 _ 2 (k - n) 

(k T 2)(k 4- 1) 

For n — 0, this equation says that a 2 /a 0 = 0. Thus this series terminates 
after the first term and the first Hermite polynomial is simply a constant. 
In this case we must set a l = 0, for otherwise the series starting with a x does 
not terminate and will behave as e y for large y. For n — 1, the situation 
is reversed. In this case, we must set a 0 = 0 and the series starting with a, 
terminates after the first term. The second Hermite polynomial is simply y. 
Finally, for n — 2, we are back to the first case except that 

— = —2 
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while a 4 and higher terms vanish. Thus the third Hermite polynomial is the 
second order polynomial 1 — 2 y 2 (or 2y 2 — l). As noted already, you can see 
these polynomials multiplying e ~ mcux ~/ 2h i n (7.44), (7.46), and (7.47). 


7.10 Inversion Symmetry and the Parity Operator 


One of the most obvious features of the energy eigenfunctions shown in Fig. 7.7 
is that they are all either even functions satisfying rf/(—x) — f (x ) or odd functions 
satisfying i/r(— x) = —ifr(x). The cause of this behavior is a symmetry in the Hamil¬ 
tonian. We introduce the parity operator n, whose action on the position states is 
given by 


fl|x) = |-jc) (7.115) 

The parity operator inverts states through the origin. An eigenstate of the parity 
operator satisfies 


n\x/, k ) = Wx) 

Since inverting twice is the identity operator, we see that 


(7.116) 


n 2 !^) =a 2 Ml) = i^> 


(7.117) 


or A 2 = 1. Thus the eigenvalues of the parity operator are X — ±1. 

We can evaluate the action of the parity operator on an arbitrary' state \yjr) by 
projecting into position space: 4 




(7.118) 


Thus a parity eigenfunction satisfies 

wniVr) = {-*1 fx) = fk(-x) = Wx(x} (7.119) 

where in the last step we have evaluated the action of the parity operator acting to 
the right on its eigenket. Thus if X = 1, the eigenfunction is an even function of .r, 
and if X = — 1, the eigenfunction is an odd function of x. 


4 The parity operator is Hermitian. See Problem 7.11. 
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It is now easy to see for the harmonic oscillator that the parity operator and the 
Hamiltonian commute. Note that 

( fc 2 j2 \ 

-“^2 + V(-X)J *(-*) 

= — + V{x)) = (x\Hh\f) (7.120) 

\ 2m dx l ) 

provided V(x)= V (— x). Thus for the harmonic oscillator, where V(jc) = ^mafx 2 , 
we deduce that ft H — H fl or 


tn,H] = 0 (7.121) 

This guarantees that the Hamiltonian and the parity operator have eigenstates in 
common, as we have seen. 

The real advantage of this symmetry approach is that by observing the symmetry 
of the Hamiltonian under inversion, we can deduce some of the properties of the 
eigenfunctions—in this case, their evenness or oddness —before rather than after 
we have solved the eigenvalue equation. We will see the utility of this approach 
in Chapter 9, when we consider other symmetries of the Hamiltonian in three 
dimensions. 


7.11 Summary 


The harmonic oscillator deserves a chapter all its own. In addition to the fact that 
an arbitrary potential energy function in the vicinity of its minimum resembles a 
harmonic oscillator (see Section 7.1), the harmonic oscillator is a nontrivial problem 
in one-dimensional wave mechanics with a nice exact solution (see Section 7.9). 
Moreover, the harmonic oscillator will also serve as the foundation of our approach to 
the quantum theory of the electromagnetic field in Chapter 14. One of the underlying 
reasons for such a broad significance of the harmonic oscillator is that we can 
determine the eigenstates and eigenvalues of the Hamiltonian 

» 2 i 

H =+-mco 2 x 2 (7.122) 

2m 2 

in a completely representation-free way. We introduce the lowering and raising 
operators 


a — 




(7.123) 
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and 



*- Px 

mao 


(7.124) 


The position and momentum operators are then written in terms of the raising and 
lowering operators as 


x = 


h 

(a + a ) 


2mco 


and 


. jmcoh „ „ t 


Using (7.125) and (7.126) as well as the commutation relation 

[a, a 1=1 


(7.125) 


(7.126) 


(7.127) 


which follows from the commutation relation [x, p x ] = ih between the position and 
momentum operators, we can express the Hamiltonian in the form 

H = hco^a + ^j (7.128) 

The eigenstates of the Hamiltonian satisfy 

H\n) — [n + - j Kco\n) n= 0,1,2,... (7.129) 

where the state | n) is obtained by letting the raising operator act n times on the lowest 
energy state |0): 

| /;) = _L(« t )"|0) (7-130) 

g/n! 

The action of the raising and lowering operators on the energy eigenstates is given by 

a'ln) = y/n + 1 \n + 1) (7.131) 


and 


a\n) — *Jn\n — 1) (7.132) 

which again follow from the commutation relation (7.127). These raising and low¬ 
ering operators provide a powerful way to evaluate expectation values and matrix 
elements of the position and momentum operators (7.125) and (7.126), without hav¬ 
ing to work directly with wave functions in either position or momentum space. 
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Since each increase or decrease in n by a unit increases or decreases the energy of 
the oscillator by Hco, we can think of the oscillator as containing n quanta of energy 
hco, in addition to the zero-point energy hco/2. The operator a 1 ' creates a quantum of 
energy and can hence be called a creation operator, while the operator a annihilates 
a quantum of energy and is called an annihilation operator. 

The state that most closely resembles the classical motion of a particle confined 
in a harmonic oscillator potential is the coherent state 

°o ,, 

l«) = e~ M2/2 V ~ I n) (7.133) 

H=0 V-« ! 

The coherent state is an eigenstate of the lowering operator: 

a\a}=a\a) (7.134) 

The coherent state, which can be generated by displacing, or translating, the ground 
state |0), is a minimum uncertainty Gaussian wave packet in position space, one that 
oscillates back and forth in the potential energy well without dispersion, maintaining 
AxAp x = h/2. 

Problems 


7.1. Show that the constant c_ = *Jn in (7.24), that is, 

a\n) = */n \n — 1} 

using the procedure we used to establish that c + = + 1, that is, 

a'\n) — \fn + 11 n + 1) 

7.2. Use the matrix representations (7.35) and (7.36) of the raising and lowering 
operators, respectively, to determine the matrix representations of the position and 
the momentum operators using the energy eigenstates as a basis. Verify using these 
matrix representations that the position-momentum commutation relation (7.7) is 
satisfied. 

7.3. Show that properly normalized eigenstates of the harmonic oscillator are given 
by (7.37). Suggestion: Use induction. 

7.4. Use < 310 ) =0 and therefore (p\a [ 0 ) = 0 to solve directly for {// 0), the ground- 
state wave function of the harmonic oscillator in momentum space. Normalize the 
wave function. Hint: Recall the result of Problem 6.2. 

dp 
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7.5. Derive (7.56) and (7.57). 

7.6. Use the Heisenberg uncertainty relation Ax A p x > k/2 to express the expecta¬ 
tion value of the energy (7.53) as an inequality involving the uncertainty in position 
Ax. Show that the Ax that minimizes this expectation value corresponds to a lower 
bound on the energy that is equal to the ground-state energy £ 0 = ha>/2 of the har¬ 
monic oscillator. From this result we can infer that the ground state is the minimum 
uncertainty state. 

7.7. A particle of mass m in the one-dimensional harmonic oscillator is in a state 
for which a measurement of the energy yields the values tico/2 or 3hco/2, each with 
a probability of one-half. The average value of the momentum ( p x ) at time t = 0 
is Jmcoh/2. This information specifies the state of the particle completely. What is 
this state and what is (p x ) at time t? 

7.8. 

(a) Determine the size of the classical turning point x 0 for a harmonic oscillator in 
its ground state with a mass of 1000 kg and a frequency of 1000 Hz. Compare 
your result with the size of a proton. A bar of aluminum of roughly this mass 
and tuned to roughly this frequency (called a Weber bar) has been used in 
attempts to detect gravity waves. 

(b) Suppose that the bar absorbs energy in the form of a graviton and makes a 
transition from a state with energy E n to a state with energy E n+l . Show that 
the change in length of such a bar is given approximately by x 0 (2/7i) 1/2 for 
large n. 

(c) To what n, on the average, is the oscillator excited by thermal energy if the 
bar is cooled to 1 K? 

7.9. Show that in the superposition of adjacent energy states (7.63) the average value 
of the position of the particle is givemby 

(x) = (x/f\x\\jf) = A cos (cot + 8) 
and the average value of the momentum is given by 

{Px) = (f\Px\f) = -mwA sin (cut + 3) 
in accord with Ehrenfest’s theorem, (6.33) and (6.34). 

7.10. A small cylindrical tube is drilled through the Earth, passing through the center. 
A mass m is released essentially at rest at the surface. Assuming the density of the 
Earth is uniform, show that the mass executes simple harmonic motion and determine 
the frequency to. Determine the approximate quantum number n for this state of the 


Page 293 (metric system) 



278 | 7. The One-Dimensional Harmonic Oscillator 


mass, using a typical macroscopic value for the magnitude of the mass m. Explain 
why a single quantum number n is inadequate to specify the state. 

7.11. Prove that the parity operator n is Hermitian. 

7.12. Substitute t/r (x ) = Ne~~ ax ~ into the position-space energy eigenvalue equation 
(7.95) and determine the value of the constant a that makes this function an eigen¬ 
function. What is the corresponding energy eigenvalue? 

7.13. Calculate the probability that a particle in the ground state of the harmonic 
oscillator is located in a classically disallowed region, namely, where V(x) > E. 
Obtain a numerical value for the probability. Suggestion: Express your integral in 
terms of a dimensionless variable and compare with the tabulated values of the error 
function. 

7.14. As shown in Section 7.1, for small oscillations a pendulum behaves as a simple 
harmonic oscillator. Suppose that a particle of mass m is in the ground state of a 
pendulum of length L and that instantaneously the length of the pendulum increases 
from L to 4 L. What is the probability the particle will be found to be in the ground 
state of this new oscillator? Give a numerical answer. 

7.15. Follow the procedure outlined in Example 7.5 to determine the 4th and 5th 
Hermite polynomials, corresponding to n = 3 and n — 4, respectively. 


7.16. The coherent state \a) is an eigenket of the lowering operator: 

a\a) = a\a) 

Investigate whether it is possible to construct an eigenket of the raising operator a?. 

7.17. We can safely say that the coherent state is as close to purely classical oscil¬ 
latory motion as wc are going to get in quantum mechanics. An interesting limiting 
case is worthy of mention. Show that as fi ->• 0 the probability density 

| lx 10 ) 1 2 = 

V ith 

for the ground state becomes a Dirac delta function (see Appendix C). What happens 
to the momentum and position uncertainties for the coherent state in this limit? Is 
your result in accord with the uncertainty relation Ax A p x = hj 2? Explain. 


7.18. Verify that the expectation values 


(a'(t)|x|a(t)) 



2\a. \ cos(o)t + 8) 
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and 


(a(t)\p x \a(t)) = — 2\a \ sin(<yf + <5) 

for the coherent state |a(f)), where a = \a\e~ ,s t satisfy the Ehrenfest relations 

d(x} (pj 


dt 


and 


7.19. Prove that 


d(Px 

dt 


m 

_dV_ 

dx 


e A +B _ e A e B e -[A,B)/2 


when the operators A and B each commute with their commutator [A, B], that is, 
[A, [B, A]] = 0 and [B, [B, AJJ = 0. 

(a) First use induction to show that 

[B, A n ] = nA n ~ l [B , A] 

Recall from Problem 3.1 that in general 

[A, BC] — B[A, C] + [A, B]C 


(b) Use the result of (a) to show that 

[B, F(A)] = F'(A)[B, A] 

where F'(x) = dF/dx. Suggestion: Think of F(x) in terms of a Taylor series 
expansion. 

(c) Define f(X) = e x ^e x ®. Show that 

df 


dX 


(A + B+X[A,B])f(X) 


Finally, integrate this equation to obtain 

= gMA+B) g[A,B]k-12 

7.20. Show that 


°o 

D(«)|0> = e «“ ] e ~a*a e \aaX a *aV2 |q> _ _£!L |„) - | a ) 


77=0 
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7.21. Verify that 


(x) = d cos cot 

for the coherent state T u/)|0) that is generated by translating the ground state |0) of 
the harmonic oscillator by a distance d. 

7.22. Calculate (E) and the uncertainty A E for the coherent state |a). 

7.23. Evaluate |(/S|ct!)|" for coherent states |a) and \f3). Show that the states |a) and 

\fi) become approximately orthogonal in the limit |ck — $| 1 . 

7.24. The eigenstate of a with eigenvalue a is the coherent state |a). This state is 
a minimum uncertainty state. The ground state of the harmonic oscillator is also a 
minimum uncertainty state. Is the ground state a coherent state? If so, what is the 
corresponding eigenvalue a? Is the time evolution of the ground state consistent with 
(7.82)? 
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Path Integrals 


Our discussion of time evolution has emphasized the importance of the Hami ltonian 
as the generator of time translations. In the 1940s R. P. Feynman discovered a way to 
express quantum dynamics in terms of the Lagrangian instead of the Hamiltonian. 
His path-integral formulation of quantum mechanics provides us with a great deal 
of insight into quantum dynamics, which alone makes it worthy of study. The com¬ 
putational complexity of using this formulation for most problems in nonrelativistic 
quantum mechanics is sufficiently high, however, that the path-integral method re¬ 
mained something of a curiosity until more recently, when it was realized that it also 
provides an excellent approach to quantizing a relativistic system with an infinite 
number of degrees of freedom, a quantum field. 


8.1 The Multislit, Multiscreen Experiment 


We can get the spirit of the path-integral approach to quantum mechanics by con¬ 
sidering a straightforward extensionJbf the double-slit experiment. Recall that the 
interference pattern in the double-slit experiment, shown in Fig. 8.1, can be un¬ 
derstood as a probability distribution with the probability density at a point on the 
detecting screen arising from the superposition of two amplitudes, one for the par¬ 
ticle to reach the point going through one of the slits and the other for the particle 
to reach the point going through the other slit. Suppose we increase the number of 
slits from two to three. Then there will be three amplitudes (see Fig. 8.2a) that we 
must add together to determine the probability amplitude that the particle reaches 
a particular point on the detecting screen. Suppose we next insert another opaque 
screen with two slits behind the initial screen (Fig. 8.2b). Now there are six possible 
paths that the particle can take to reach a point on the detecting screen; thus we must 
add six amplitudes together to obtain the total amplitude. One can imagine filling 
up the space between the source and the detecting screen with an infinite series of 

281 
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Figure 8.1 The two paths in the double¬ 
slit experiment. The amplitudes for these 
paths add together to produce an interference 
pattern on a distant detecting screen. 



(a) 



Figure 8.2 (a) The three paths for a triple¬ 
slit experiment, (b) Three of the six paths 
that a particle may follow to reach a partic¬ 
ular point on the detecting screen when an 
additional screen with two slits is inserted. 


opaque screens and then eliminating these screens with an infinite number of slits 
in each screen. In this way, we see that the probability amplitude for the particle to 
arrive at a point on the detecting screen with no barriers in between the source and 
the detector must be the sum of the amplitudes for the particle to take every path 
between the source and the detection point. 


8.2 The Transition Amplitude 


We are now ready to see how we use quantum mechanics to evaluate the amplitude 
to take a particular path and how we add these amplitudes together to form a path 
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integral. 1 In this chapter we will concentrate on a one-dimensional formulation of 
the path-integral formalism. The extension to three dimensions is straightforward. 

We start with the amplitude {x ', t'\x 0 , t 0 ) for a particle that is at position x 0 at 
time t 0 to be at the position x' at time t'. In Chapter 4 when we introduced the subject 
of time evolution, we chose to set our clocks so that the initial state of the particle 
was specified at t = 0 and then considered the evolution for a time t. Here we are 
calling the initial time t 0 and considering the evolution for a positive time interval 
t' — f 0 . Thus the transition amplitude is given by 

(*', t' |JC 0 , t 0 ) = (x'\ U{t' - f 0 )|x 0 > = {x'^V-M^xq) (8.1) 

where U (t' — t 0 ) is the usual time-evolution operator and the Hamiltonian, which 
is assumed to be time-independent, is in general a function of the position and 
momentum operators: H — II (p x , x ). Of course, in the usual one-dimensional case 


H = ^- + V(x) (8.2) 

2m 

Once we know the amplitude (8.1), we can use it to determine how any state | xj/) 
evolves with time, since we can write the state |^r) as a superposition of position 
eigenstates: 


(x'\xlf(t')) = (x'\e^^ (t '- to)/n \ir(t 0 )) 



dx 0 (x'\e '^ (f ' ,0>/n \x 0 )(x 0 \f(t 0 )) 
dx o {x\ t'\x 0 , t 0 ){x 0 \f(t 0 )) 


(8.3) 


The amplitude (x\ t’\x Q , r 0 ), which appears within the integral in (8.3), is often 
referred to in wave mechanics as the propagator; according to (8.3) we can use 
it to determine how an arbitrary state propagates in time. 


1 Our approach is not that initially followed by Feynman, who essentially postulated (8.28) in 
an independent formulation of quantum mechanics and then showed that it implied the Schrodinger 
equation. Here, we start with the known form for the time-development operator in terms of the 
Hamiltonian and from it derive (8.28), subject to certain conditions on the form of the Hamiltonian. 
For a discussion of Feynman’s approach, see R. R Feynman and A. R. Hibbs, Path Integrals and 
Quantum Mechanics, McGraw-Hill, New York, 1965. For Feynman’s account of how he was 
influenced by Dirac’s work on this subject, see Nobel Lectures — Physics, vol. Ill, Elsevier, New 
York, 1972. For a very nice physical introduction to path integrals, see R. P. Feynman, QED: The 
Strange Theory of Light and Matter, Princeton University Press, Princeton, NJ, 1985. 
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As an example, let’s evaluate the propagator for a free particle using our earlier 
formalism. The Hamiltonian for a free particle is given by 


H = 


z! 

2m 


Inserting a complete set of momentum states 

/ OO 

dp \p){p\ = 1 

-OO 

in (8.1), we obtain 

f°° ,*» , 

(x\ t '|* 0 . t 0 ) = / dp [x'\e ,p * {r ~ r ° )/2mh \p)(p\x 0 ) 
J — OO 

/ °° 7 , 

dp {x , \p)(p\x 0 )e~ ,p ' (, - tQ)/2mh 

-OO 

Using 


Wp) 


Jpx/h 


y/lnh 


we see that 


{x, f'U 0 


’ U)) 2t riif-t 


dp e' 




(8.4) 


(8.5) 


( 8 . 6 ) 


(8.7) 


( 8 . 8 ) 


This is a Gaussian integral, which can be evaluated using (D.7): 


<*', ?W to) = (8 . 9 ) 

V 2nru(t — / 0 ) 

Problem 8.1 illustrates how we can use this expression for the propagator to deter¬ 
mine how a Gaussian wave packet for a free particle evolves in time. 


8.3 Evaluating the Transition Amplitude for Short 
Time Intervals 


In order to evaluate the transition amplitude [x\ t'\x 0 , t 0 ) for the interacting case for 
a finite period of time using the path-integral formalism, we first break up the time 
interval t' — t 0 into N intervals, each of size At = (/' — t 0 )/N. We will eventually 
let N oo so that At —> 0. Thus we are interested first in evaluating the transition 
amplitude for very small time intervals. In this limit we can expand the exponential 
in the time-evolution operator in a Taylor series: 

e ~niAt/h _ | _ Lfj(p x)At + O(Ar) (8.10) 

n 
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where the expression ()(At 2 ) includes the A t 2 and higher powers of At terms. If we 
now evaluate the amplitude for a particle at * to be at x' a time At later, we obtain 


(x'\e- itiAl/h \x) = (x'\ 


(x'\ 


1 - - H(p x , l)A t 
fi 


\x) + O(Ar) 


r 


\x) + 0(Ar) (8.11) 


It is easy to evaluate the action of V (x) since the ket in (8.11) is an eigenstate of the 
position operator and therefore 


V i.v)|.v) = V (jc) | jc) 


( 8 . 12 ) 


In order to evaluate the action of the kinetic energy operator, it is convenient to insert 
the complete set of momentum states (8.5) between the bra vector and the operator 
in (8.11) and then take advantage of 


(P\P X = ( P\P 


(8.13) 


In this way we obtain 


/ OO 

dp W\pHp\ 

-oo 


£ 


oo 

oo 


1- 


r £l 

2m 


V (x) 


At |t) + 0{At z ) 


dp t/\p)(p\ 


1- E(p, x)At 

h 


\x) + 0{Ar) (8.14) 


where 


E{p. x) =fe-—-(- V(jc) 

2m 


We now take advantage of (8.10) in reverse to write 


(8.15) 


1 - - E(p , x)At = + 0 (At 2 ) 

Pi 


(8.16) 


Thus the transition amplitude (8.14) becomes 


/| -ifiAt/h 


(x \e 


x) 


/ OO 

dp {x'\p){p\x)e- iE{ P' x)Al l h + 0(Ar 2 ) 

-OO 

1 f 00 

= —/ dpe' p(x '- x)/n e~ iE(p - x)A,/h + 0(At 2 ) (8.17) 
2nhJ-oo 
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or simply 


fi — i H At / h 


(x\e 


x) 


r d p i 

J-o o 2 nk CXP | - 


(x - X) 

P —-- E(p, X ) 

At 


At \ + O(Ato) 

(8.18) 


Equation (8.18) is deceptively simple in appearance. Although we characterized 
(8.16) as (8.10) in reverse, the exponential (8.10) contains the Hamiltonian operator, 
while the exponential (8.16) involves no operators at all. Where have the operators 
gone? The answer is that we have avoided much of the complexity of having to deal 
with the exponential of an operator by retaining just the terms through first order in 
At in (8.11). These complications are absorbed in the 0(At 2 ) term in (8.14). For 
example, if we were to try to calculate the At 2 term in (8.14), we would see that the 
fact that the position and momentum operators in the Hamiltonian do not commute 
prevents our replacing both these operators with ordinary numbers by inserting just a 
single complete set of momentum states. But if we consider the limit of the transition 
amplitude (8.18) as At —>• 0, we can ignore these 0(At 2 ) complications. We will 
next see, however, that there is a penalty to pay for formulating quantum mechanics 
in a way that eliminates the operators that have been characteristic of our treatment 
of time development using the Hamiltonian formalism. 


8.4 The Path Integral 


We are now ready to evaluate the transition amplitude ( x' , / j.v 0 , t 0 ) for a finite time 
interval. As we suggested earlier, we break up the interval t' — t 0 into N equal-time 
intervals At with intermediate times t t , t 2 , ■ ■ ., t N _ ,, as shown in Fig. 8.3. Therefore 


(x\ t'\x 0 , to) = (JC'I | X() ) 

N times 

We next insert complete sets of position states 



\Xj)(Xi\ = 1 


i = 1, 2, . .., N — 1 


(8.19) 


( 8 . 20 ) 


between each of these individual time-evolution operators: 

(x\ tW<o) = j dx x --‘j dx N _ x (x'\e~ ,HA,/n \x N _ l ){x N _ l \e~ lHAt/h \x N _ 2 ) ■ ■ ■ 
x {x 2 \e- iftK,lh \x x ){x x \e~ iiiAtlh \xo) (8-21) 


[-*— At —-»j-«— At —»j — At —»-] 

_I_I_I_1_I_I_I_L 

to t\ t 2 to- i t' 

Figure 8.3 The interval t' — r 0 is broken into N time intervals, each of length At. 
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x 



Figure 8.4 A possible path taken by the particle in going from position x 0 at time t 0 to 
a position x' at time t, with intermediate positions ,r, at time t x . x 2 at time f 2 , and so on. 


where each of the integrals is understood to run from —oo to oo, as indicated in 

(8.20) . Reading this equation from right to left, we see the amplitude for the particle 
at position x 0 at time r 0 to be at position x x at time t x = t 0 + At, multiplied by the 
amplitude for a particle at position x x at time t 0 + At to be at position x 2 at time 
t 2 = t 0 + 2At. This sequence concludes with the amplitude for the particle to be 
at x' at time t' when it is at position x N _ x at a time At earlier. Figure 8.4 shows 
a typical path in the x-t plane for particular values of x x , x 2 , . .., x N _ x . Note that 
we are integrating over all values of x x , x 2 , .. ., x N _\ in (8.21). Thus, as we let 
At -> 0, we are effectively integrating over all possible paths that the particle can 
take in reaching the position x ' at time t' when it starts at the position x 0 at time t 0 . 

We now use the expression (8.18) for the N amplitudes (x i+] \e~ lHAl,lh \x i ) in 

(8.21) , provided we are careful to insert the appropriate values for the initial and 
final positions in each case. If we let N -U- oo, and correspondingly At —> 0, we can 
ignore the 0(At 2 ) term in each of the individual amplitudes, and the expression for 
the full transition amplitude is exactly given by 


(x, t'\x 0 , r 0 ) = Urn 

N—±oo 


j dx x - ■ ■ J dx N _ { j 


dp i 


i 


dp N 
2nh 


exp 




h l 


i=i 


2ith 

{Xj — 

At 


E{Pi, x, .,) 


At (8.22) 


where we have called the final position x' — x N in the exponent. 

We now face a task that appears rather daunting: evaluating an infinite number 
of integrals. In fact, (8.22) involves both an infinite number of momentum and an 
infinite number of position integrals. Fortunately, for a Hamiltonian of the form (8.2), 
each of the momentum integrals is a Gaussian integral, which can be evaluated using 
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(D.7) [with a = iAt/2mh and b = i (x, — x t . y)/h], A typical momentum integral is 

given by 


[ ^El 

J 2nh 


exp 


■ Pi At , -PMi 

-i -- + t- 


2 nth 


h 


m 


exp 


V In hi At 
After doing all of the p integrals, we find 


i_ (mAt \ (Xj - x t _! 

h 


lim 

1 dx \••• 

N->oo J 



f ; N 

x exp • 

-a tjr 

* s 


At 


m \ 


(8.23) 


N/2 


m / Xj - x,_ i 
2 V At 


JjthiAt, 

2 


V(x,_ i) 


(8.24) 


Notice that as N —> oo and therefore Ar —> 0, the argument of the exponent becomes 
the standard definition of a Riemannian integral: 


N 


lim L At y\ m l x ‘~ x ^ 

yv-»oo,Ar—<•()/), ^—' 

i =1 


At 


Vix.-y) 


= L f' 

ft Jt,\ 


dtL(x,x) (8.25) 


where 


T .. m (dx \ 
L(Xy X) = — I - 

2 V dt) 


1 0 

F(x) = -mx 2 ~ V (*) 


(8.26) 


is the usual Lagrangian familiar from classical mechanics. 2 

Finally, it is convenient to express the remaining infinite number of position 
integrals using the shorthand notation 

£ drsf 2 < 8 - 27 > 

which is a symbolic way of indicating that we are integrating over all paths connect¬ 
ing x 0 to x'. Then 


where 


(x\ t’\x 0 , t 0 ) = r Z>[jc(f)]e ist * (,)1/s 
Jx {) 


(8.28) 


5[x(r)] = 



t/f L(x, x) 


(8.29) 


If you are not familiar with the Lagrangian and the principle of least action, a brief but 
entertaining introduction is given in The Feynman Lectures on Physics , Vol. II, Chap. 19. 
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is the value of the action evaluated for a particular path taken by the particle. An 
integral such as (8.28) is referred to as a functional integral. In summing over all 
possible paths, we are really integrating over all possible functions x(t) that meet 
the boundary conditions x(t 0 ) = x 0 and x(t') = x’. 

In summary, in order to determine the amplitude for a particle at position ,v 0 at 
time t 0 to be found at position x' at time we consider all paths in the x-t plane 
connecting the two points. For each path x(t), we evaluate the action S[x(f)]. Each 
path makes a contribution proportional to g'SbdOl/^ a factor that has unit modulus 
and depends on the path only through the phase S\x(t)\/fi. We then add up the 
contribution of each path. Note that in a formulation of quantum mechanics that 
starts with (8.28), operators need not be introduced at all. However, we must then face 
the issue of actually evaluating the path integral in order to determine the transition 
amplitude, or propagator. To give us some confidence that this is indeed feasible, at 
least in some cases, we first reconsider the evaluation of the transition amplitude (8.9) 
for a free particle, this time with the path-integral formalism. Then, in Section 8.6, we 
will use the path-integral formulation to examine the relationship between quantum 
and classical mechanics. 

8.5 Evaluation of the Path Integral for a Free Particle 


In order to evaluate the path integral (8.28) for a free particle, for which V(x ) = 0, we 
retrace our derivation of (8.28) and break up the time interval t' — ? 0 into N discrete 
A t intervals: 


(x\ t'\x 0 , t 0 ) = 




We introduce the dimensionless variables 



(8.30) 




2llAt 


(8.31) 


where again x N = x 1 . Expressed in terms of these variables, the transition amplitude 
becomes 


.. ( m 


yv-*oo 


L 


x / dyjv-jexp 


'X! (y > ~ 


(8.32) 


L i=1 


Note that we have explicitly inserted the limits of integration. 
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Let’s start with the y, integral, leaving aside for the moment the constants in 
front: 

f dy { e i[ (w-ri^+O-i-ro) 2 ! 

J ~ 00 

where we have taken advantage of (D.7). Fortunately, evaluating this integral has 
left us with another Gaussian. We are thus able to tackle the y 2 integral, again with 
the aid of (D.7): 


I l jL. e Hyr~yof/t 

2 


(8.33) 



dyo e iliy i- y2}2+(y2 - yo)2/2] 


^ e i(y 3 -y^/3 



gHyr-yi o > 2 / 3 


(8.34) 


A comparison of the result of the yq integral (8.33) with the result of having done both 
the v : ] and the y 2 integrals in (8.34) suggests that the result of (N — 1) y integrals is 
just 




~ yi ~^ 2 


L i —1 



e HyN-yo) 2 /H 


(8.35) 


which can be established by induction. See Problem 8.2. Thus 


(x\ t j.r 0 , to) — bm 


m 


n^oo \ 2nhiAt 




N 


lim 1( 
n-+o oV 2nhiNAt 


,n xq)~ /2HN At 


m _ im(x' ~x 0 ) 2 /2h(t'-ty) 


V 2 7THi(t' — t 0 ) 


(8.36) 


where we have used t' — to — NAt in the last step. This result is the same as 
we obtained with considerably less effort in Section 8.2 using the Hamiltonian 
formalism. 

There is a limited class of problems with Lagrangians of the form 


L(x, x) =a + bx + cx 2 + dx 4- exx + fx 2 (8.37) 


where the integrals in the path-integral formulation are all Gaussian and the pro¬ 
cedure we have outlined for the free particle can also be applied to determine the 
transition amplitude. In general, this is a fairly cumbersome procedure, but there are 
some shortcuts that can be used to determine the amplitude in these cases. The in¬ 
terested reader is urged to consult Feynman and Hibbs, Path Integrals and Quantum 
Mechanics. The main utility of the path-integral approach in nonrelativistic quantum 
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mechanics is not, as you can probably believe, in explicitly determining the transi¬ 
tion amplitude but in the alternative way it gives us of viewing time evolution in 
quantum mechanics and in the insight it gives us into the classical limit of quantum 
mechanics. 


8.6 Why Some Particles Follow the Path of Least Action 


Equation (8.28) is an amazing result. Not only does every path contribute to the 
amplitude, but each path makes a contribution of the same magnitude. The only 
thing that varies from one path to the next is the value of the phase S\x(t)/h\ 
Since quantum mechanics applies to all particles, why then, for example, does a 
macroscopic particle seem to follow a particular path at all? 

In order to see which paths “count,” let’s consider an example. Suppose that at 
t = 0 a free particle of mass m is at the origin, x = 0, and that we are interested 
in the amplitude for the particle to be at x = x' when / = t'. There are clearly an 
infinite number of possible paths between the initial and the final point. One such 
path, indicated in Fig. 8.5, is 





(8.38) 


This is, of course, the path that a classical particle with no forces acting on it and 
moving at a constant speed v = x'/t’ would follow. For this path, x = x'/t' and 
therefore L = mx 2 /2 = mx a )2t a . Consequently 

r, f 1 , m (x’ 2 \ mx' 2 

= ,8.39) 



Figure 8.5 Two paths connecting the initial position 
v — 0 at t — 0 and the final position x at time the 
classical path for a free particle, x d = (x’ji')I, and the 
path x = ( x'/r !l )t 2 . 


Page 307 (metric system) 
















292 I 8. Path Integrals 


If we evaluate the phase S d /h for typical macroscopic parameters such as m = 1 g, 
x = 1 cm, and t' = 1 s, we find that the phase has the very large value of roughly 
(1/2) x 10 27 radians. 

We also choose another path, which is also depicted in Fig. 8.5, namely, 


x = 



(8.40) 


This path, which is characteristic of a particle undergoing uniform acceleration, is 
clearly not the classical path for a particle without any forces acting on it. For this 
path we find L = mx 1 /2 = lmx' 2 t 2 / 1 14 and therefore 



(8.41) 


The value of the phase is roughly (2/3) x 10 2 ' radians for the same macroscopic 
parameters. 

Although the phases determined from (8.39) and (8.41) are different, what really 
distinguishes the classical path from any other is not the actual value of the action 
itself. Rather, the classical path is the path of least action, or, more precisely, the 
one for which the action is an extremum. To illustrate this explicitly, we consider a 
set of paths in the neighborhood of the two paths that we are using as examples. In 
the vicinity of the classical path, we take the set of paths 


x 


x 

T 7 


t + £ 


Hf-t') 

t' 


(8.42) 


where each value of the parameter s labels a different path that deviates slightly 
from the classical path if s is small. Notice that x(t) still satisfies the initial and final 
conditions x(0) = 0 and.r(F) =x\ respectively. It is straightforward to calculate the 
action: 



The important thing to notice is that the change in the action depends on £ 2 ; there is 
no term linear in e. The action is indeed a minimum; varying the path away from the 
classical path only increases the action from its value (8.39). Because the first-order 
contribution to the action vanishes: 


SS = 



£ — 0 

£=0 


(8.44) 
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the contribution through first order of each of the paths to the path integral is 
proportional to 


e i(S ci +SS)/h _ e iS c i/h 


(8.45) 


Thus the amplitudes for the paths in the vicinity of the classical path will have 
roughly the same phase as does the classical path and will, therefore, add together 
constructively. 

If, on the other hand, we consider the nonclassical path (8.40), we can also 
determine the action for a set of paths in its neighborhood, 


x = 


t + £ 




n2 


(8.46) 


which again satisfy x(0) = 0 and x(t') — x'. If we now calculate the action for (8.46), 
we obtain 


s =¥ ! ( i+ i + -)= 5 ( rf/ '' 2 )( i+ l + '--) (8 - 47 ' 

Here, in agreement with the principle of least action, the first-order correction 
SS — (dS/de) e=0 £ 7 ^ 0. Some neighboring paths, in this case those with e < 0, 
reduce the value of the action from its value for the path (8.40). The contribution 
through first order of the paths in the vicinity of the path (8.40) to the path integral is 
e i(S+8S)/h . Th USj j n general, paths in the neighborhood of the nonclassical path may 
be out of phase with each other and may interfere destructively. 

A useful pictorial way to show how this cancellation arises is in terms of phasors. 
For convenience, let’s assume that we can label the paths discretely instead of 
continuously. When we add the complex numbers 

e isix } (t)yn _ CQS six^m/h + i sin S[x y (t)]/h (8.48a) 


and 


e iS[x 2 (t)yn _ CQS s[x 2 (t)]/h + i sin S[x 2 (t)]/h (8.48b) 

together for two paths, we just add the real parts and the imaginary parts together 
separately. The magnitude of this complex number is of course given by 

e iS[x\(t)\/h e iS[x 2 (t)\/h 

0 1 o\ 1 /2 

cos S[xi(t)]/h + cos S[x 2 (t)]/ti\ + {sin S[xi{t)]/h + sin S[x 2 (t)]/hYj 

(8.49) 


Page 309 (metric system) 



294 | 8. Path Integrals 



Figure 8.6 The addition of the 
amplitudes arK j 

is carried out using phasors. Each of 
the amplitudes is represented by an 
arrow of unit length in the complex 
plane, with an orientation angle, or 
phase angle, S\x(t)]/h. The rule for 
gisth adding the two amplitudes is the 
same as for ordinary vector addition. 


We can recognize this as the same procedure we would use to find the length of an 
ordinary vector resulting from the addition of two vectors, V = Vj + V 2 , namely, 

|V| = [(v lx + v 2x ) 2 + (v ]y + V 2v ) 2 ] 1/2 (8.50) 

Thus, if we indicate the complex amplitudes (8.48) by vectors in the complex plane, 
with the real part of the amplitude plotted along one axis and the imaginary part of 
the amplitude plotted along the other axis, the complex number resulting from the 
addition of the two amplitudes (8.48a) and (8.48b) is just the vector sum. as shown 
in Fig. 8.6. 

What happens as we add up the contributions of the nonclassical path (8.40) 
and its neighbors? Notice that the first-order change in the action from (8.47) is 
proportional to the value S of the action itself multiplied by s. As s changes away 
from zero, the phase of the neighboring path changes. In the particular case (8.46), 
we see that when sS(x't 2 /t a )/2h = 2n , the phase has returned to its initial value, 
modulo 2 tt. Thus if S/h is 10 27 for some typical macroscopic parameters, e need 
only reach the value s = 4jt x 10~ 27 to satisfy this condition. In Fig. 8.7a we 
add up the arrows for a discrete set of paths representing those between e = 0 
and s — An x 10“ 27 . These arrows form a closed “circle” and therefore sum to 
zero. Thus, the contributions from these paths cancel each other and hence do not 
contribute to the path integral (8.28). On the other hand, in the vicinity of the classical 
path (8.38), the first-order contribution to the action vanishes and thus the paths in 
the vicinity of the classical path have the same phase and add together constructively 
(Fig. 8.7b). This coherence will eventually break down, when the phase shift due to 
nearby paths reaches a value on the order of re. For our specific example (8.43), this 
means S ci s 2 /3h ~ n, or s — lO -13 for the macroscopic parameters. This is clearly 
a very tight constraint for a macroscopic particle, since the paths that count do not 
deviate far from the path of least action. But the classical path is still important 
because only in its vicinity can many paths contribute to the path integral coherently. 
In the neighborhood of any other path, the contributions of neighboring paths cancel 
each other (see Fig. 8.8). Quantum mechanics thus allows us to understand how a 
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Figure 8.7 (a) The sum of a discrete set of amplitudes representing those in the vicinity 
of the nonclassical path. Since these arrows form a closed “circle,” their sum vanishes, 
(b) In the vicinity of the classical path, the amplitudes, which all have the same phase to 
first order, sum to give a nonzero contribution to the path integral. 


particle knows to take the path of least action, at least in classical physics: the particle 
actually has an amplitude to take all paths. 

Our numerical examples in this section so far have been entirely about a macro¬ 
scopic particle. What happens if we replace the 1 g mass with an electron? Notice 
that the phase difference between the two paths in Fig. 8.5 is given by 

AS S(x't 2 /t' 2 ) - S d mx a 

- ± = — (8.51) 

n h 6th 

While for m = 1 g with x' = 1 cm and t' = 1 s this phase difference is about (1/6) x 
I0 27 radians, the phase difference between the two paths for the electron, for which 
m ~ 1()“ 27 g, is only | radian. Thus for an electron even the path jc = x't 2 /t a is 
essentially coherent with the classical path x = x'tjt'. Because there are many more 
paths that can contribute coherently to the path integral for the electron than there 
are for the macroscopic particle, the motion of the electron in this case should be 
extremely nonclassical in nature. j» 

This last example with the electron is sufficient to cause us to wonder again about 
the double-slit experiment. Why do we see a clear interference pattern arising from 


I m e iS/fl 


Re e m 



Figure 8.8 A schematic diagram us¬ 
ing phasors showing for a macroscopic 
particle how the classical path and its 
neighbors dominate the path integral, 
while other paths give no net contribu¬ 
tion as they and their neighbors cancel 
each other. 
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Figure 8.9 A path that does not contribute coherently to the 
double-slit experiment illustrated in Fig. 8.1. 


the interference of the amplitudes to take just the two paths shown in Fig. 8.1? Why 
don’t other paths, such as the one indicated in Fig. 8.9, contribute? The answer is 
that the action for the paths indicated in Fig. 8.1 is actually much larger than the 
previous example might lead you to think. For example, electrons with 50 eV of 
kinetic energy, a typical value for electron diffraction experiments, have a speed of 
4 x 10 8 cm/s. Thus if we take x' — 40 cm as a typical size scale for the double-slit 
experiment and t' — 10~ 7 s so that the speed has the proper magnitude, we find the 
phase S/K for the straight-line path in Fig. 8.5 to be 7 x 10 9 . When the phase is 
this large, only a small deviation away from the classical path will cause coherence 
to be lost. The large size of the action is also the reason that we can use classical 
physics to aim an electron gun in a cathode ray tube, where the electrons may have 
an energy of 5 keV, or to describe the motion of atoms through the magnets in the 
Stem-Gerlach experiments in Chapter 1. 


EXAMPLE 8.1 Figure 8.1 shows the two paths that are used to analyze the 
double-slit experiment. In Example 6.5, for example, the locations of the 
maxima for a double-slit experiment carried out with monoenergetic helium 
atoms are determined by the requirement that the difference in path lengths 
between two straightline paths between the source and the detector is an 
integral number of wavelengths. For this experiment, explain why it is safe 
to ignore the curvy path shown in Fig. 8.9. 

SOLUTION According to (8.39), the action S for a free particle of mass m 
traversing a distance x' in time t' is 

,2 

S _ mX— 

21 ' 
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In the helium atom experiment, the distance traversed on a typical straightline 
path is on the order of a few meters and the time of flight is measured in 
milliseconds. We can plug these numbers into the action and evaluate S/tl. 
Alternatively, we can note that 

nix' h 

i - — mv = p = — 

t' k 

Thus 

S px' itx' 

n ~ in ~ k 

Since in the experiment k — 45 x 10~ 12 m and, say, x' — 2 m, then 

S = (n )(2 m) _ 10 n 

h 45 x 10 -12 m 

Since S/h 3> 1, the amplitude for a helium atom to travel between the source 
and the detector in the path integral is dominated by the paths in the very near 
vicinity of the paths of least action, namely the paths shown in Fig. 8.1. 


8.7 Quantum Interference Due to Gravity 


We now show how we can use path integrals to analyze a striking experiment 
illustrating the sensitivity of the neutron interferometer that we first introduced in 
Section 4.3. An essentially monochromatic beam of thermal neutrons is split by 
Bragg reflection by a perfect slab of silicon crystal at A. One of the beams follows 
path ABD and the other follows path ACD, as shown in Fig. 8.10. In general, there 
will be constructive or destructive interference at D depending on the path difference 
between these two paths. Suppose thatMie interferometer initially lies in a horizontal 
plane so that there are no gravitational effects. We then rotate the plane formed by 
the two paths by angle 8 about the segment AC. The segment BD is now higher than 
the segment AC by / 2 sin 8. Thus there will be an additional gravitational potential 



Figure 8.10 A schematic of the neutron inter¬ 
ferometer. The interferometer, initially lying in a 
horizontal plane, can be rotated vertically about 
the axis AC by an angle S. 
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Figure 8.11 The interference pattern as a function of the 
angle 5. Adapted from J.-L. Staudennrann, S. A. Werner, R. 
Colella, and A. W. Overhauser, Phys. Rev. A21, 1419 (1980). 


energy mgl 2 sin 5 along this path, which alters the action and hence the amplitude 
to take the path BD by the factor 

e ~i(mgl 2 sin S)T/h (8.52) 

where the action in the exponent is the negative of the potential energy multiplied 
by the time T it takes for the neutron to traverse the segment BD. Of course, gravity 
also affects the action in traversing the segment AB, but this phase shift is the same 
as for the segment CD, and thus the phase difference between the path ABD and the 
path ACD is given by 


S[ABD] - 5[ACD] _ mgl 2 T sin 8 
h ~ h 

m 2 gl 2 l i sin <5 
hp 

m 2 gl 2 l jA. sin 5 
2nti 2 


(8.53) 


where we have used the de Broglie relation p = hj'k to express this phase difference 
in terms of the wavelength of the neutrons. Figure 8.11 shows the interference fringes 
that are produced as <5 varies from -45° (BD below AC) to +45° (BD above AC) 
for neutrons with X— 1.419 A. The contrast of the interference pattern dies out with 
increasing angle of rotation because the interferometer bends and warps slightly (on 
the scale of angstroms) under its own weight as it is rotated about the axis AC. 

Notice in the classical limit that as H —y 0, the spacing between the fringes in 
(8.53) becomes so small that the interference pattern effectively washes out. This 
interference is, in fact, the only gravitational effect that depends in a nontrivial 
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way on quantum mechanics that has so far been observed. 3 Now, not surprisingly, 
neutrons are observed to “fall” in a gravitational field, 4 but from (6.33) and (6.34) 
we see for a gravitational field pointing in the negative x direction that 


d 2 (x) 

dt 2 


= ~g 


(8.54) 


which does not depend on Planck’s constant at all. Neither does (8.54) depend on 
the value of the mass m. This lack of dependence on m is a consequence of the 
equivalence of inertial mass m h which would appear on the left-hand side of (8.54) 
as the nijCi of Newton’s law, and the gravitational mass m g , which appears in the 
right-hand side in the gravitational force. 5 All bodies fall at the same rate because of 
this equivalence. While this equivalence has been well tested in the classical regime, 
our result (8.53), which when expressed in terms of m, and m g becomes 


S[ABD] - SfACDj _ m^iggl^ly), sin 8 
h 2ith 2 


(8.55) 


provides us with a test of the equivalence between inertial and gravitational mass at 
the quantum level. The determination of m,m g from (8.55) is in complete agreement 
with the determi nation of mj from mass spectroscopy. 


8.8 Summary 


The essence of Chapter 8 is contained in the expression 

f x ' 

(/. t'\x 0 , t 0 ) = / V[x (0] e lS[x(,m (8.56) 

J.x 0 

for the amplitude for a particle initially at position x 0 at time t 0 to be at position x' at 
later time t'. The right-hand side of (8,56) tells us that the amplitude is proportional 
to an integral of e lS ^ W)l/6 over a // p a (f ls r (/) connecting x 0 to x', subject to the 
constraint that x(t 0 ) — x 0 and x(t') = x', where 

5[x (r)J = f dtL(x,x) (8.57) 

J to 


3 On a microscopic scale, where most quantum effects are observed, gravitation is an extremely 
weak force. For example, the ratio of the electromagnetic and the gravitational forces between an 
electron and a proton is Gm e m p /e 2 = 4 x 10 -40 . 

4 A. W. McReynolds, Phys. Rev. 83, 172, 233 (1951): J. W. T. Dabbs, J. A. Harvey, D. Paya, 
and H. Horstmann, Phys. Rev. 139, B756 (1965). 

5 Near the surface of the Earth m K g = Gm^M/R 2 , where G is the gravitational constant and 
M is the mass and R the radius of the Earth. 
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Figure 8.12 (a) A single-slit diffraction experiment. The path shown with the dashed line 
is an example of a path that is obstructed by the impenetrable screen and therefore does not 
contribute to the integral over all paths, (b) A double-slit interference experiment, (c) A 
diffraction-grating experiment. 


is the value of the action evaluated for a particular path x(t). 

Although evaluating the path integral (8.56) is not especially practical in most 
problems, the path-integral approach does give us a useful way to think about 
quantum dynamics. For example, inserting an impenetrable screen with an aperture 
between a source of particles and a detector, as shown in Fig. 8.12a, eliminates many 
of the paths that the particle could follow in moving between the two points, altering 
the amplitude for the particle to arrive at the detector from what it would have been in 
the absence of the screen. We call this phenomenon diffraction. If a second aperture 
is opened in the impenetrable screen, as shown in Fig. 8.12b, the paths for the particle 
to reach the detector by traveling through this second slit must be added to the paths 
to reach the detector by traveling through the first slit, generating an interference 
pattern. In fact, if you have doubts about the role played by paths such as the one 
blocked by the screen in Fig. 8.12a, consider opening a periodic array of apertures 
in the screen to allow the particle following a special set of these paths to reach the 
detector, as shown in Fig. 8.12c. The pattern will clearly differ from that obtained 
with a single or a double slit. 

The path-integral approach also gives insight into the foundations of classical 
mechanics. Since the factor e lS l-W)]/fi j s a complex number of unit modulus, the 
only thing that differs from one path to another is the value of the phase S\x{t)yh. 
Figure 8.13 is a schematic diagram of the phase plotted as a function of the path x(t). 
The particular path where the action is an extremum —SS = (>—is often called the 
“path of least action.” This path of least action is the unique path x cl (f) that we expect 
a particle to follow in classical physics. In quantum mechanics, on the other hand, all 
paths contribute to the path integral (8.56). What makes x cl (f) special is that since 
it is the path for which the phase S\x(t)}/h is an extremum, the phase difference 
between the classical path and its neighbors changes less rapidly than for any other 
path and its neighbors. When we add up the contribution from all paths, only in the 
vicinity of the classical path do we find many paths that are in phase with each other 
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S/fi 



Jfcl t 


Path 


Figure 8.13 A schematic diagram of the phase 
S|*(r)]//i as a function of the path x(t). 


and hence can add together coherently. In situations where S[x{t)]/h 2> 1, such as 
for a macroscopic particle, this is a very tight constraint which indeed singles out 
the classical path and its very nearby neighbors. However, when S[x(t)}/h ~ 1, even 
paths that deviate significantly from the classical path can still be roughly in phase 
with it, and the behavior of the particle can no longer be adequately described by 
classical physics at all. 

Problems 


8.1. Use the free-particle propagator (8.9) in (8.3) to determine how the Gaussian 
position-space wave packet (6.59) evolves in time. Check your result by comparing 
with (6.76). 

8.2. Prove (8.35) by induction. 

8.3. Determine, up to an overall multiplicative function of time, the transition am¬ 
plitude, or propagator, for the harmonic oscillator using path integrals. See Feynman 
and Hibbs, Path Integrals and Quantum Mechanics, Sections 3.5 and 3.6. 

8.4. Estimate the size of the action for free neutrons with A. = 1.419 A traversing a 
distance of 10 cm. 

8.5. For which of the following does classical mechanics give an adequate descrip¬ 
tion of the motion? Explain. 

(a) An electron with a speed v/c = 1/137, which is typical in the ground state of 
the hydrogen atom, traversing a distance of 0.5 A, which is a characteristic 
size of the atom. 

(b) An electron with the same speed as in (a) traversing a distance of 1 cm. 

8.6. A low-intensity beam of charged particles, each with charge q , is split into two 
parts. Each part then enters a very long metallic tube shown in Fig. 8.14. Suppose that 
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Source 



Figure 8.14 A double-slit exper¬ 
iment with charged particles in 
which the particles traverse long 
metallic tubes. 


the length of the wave packet for each of the particles is sufficiently smaller than the 
length of the tube so that for a certain time interval, say from ? 0 to t', the wave packet 
for the particle is definitely within the tubes. During this time interval, a constant 
electric potential V x is applied to the upper tube and a constant electric potential V 2 
is applied to the lower tube. The rest of the time there is no voltage applied to the 
tubes. Determine how the interference pattern depends on the voltages V x and V 2 and 
explain physically why this dependence is completely incompatible with classical 
physics. 
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CHAPTER 9 


Translational and Rotational Symmetry 
in the Two-Body Problem 


After spending Chapters 6, 7, and 8 in one dimension, we now return to the three- 
dimensional world and consider a system consisting of two bodies that interact 
through a potential energy that depends only on the magnitude of the distance 
between them. The Hamiltonian for this system is invariant under translations and 
rotations of both of the bodies, which leads to conservation of total linear momentum 
and relative orbital angular momentum, respectively. The relationship between an 
invariance, or a symmetry, in the system and a corresponding conservation law is 
one of the most fundamental and important in physics. 

9.1 The Elements of Wave Mechanics in Three Dimensions 


Let’s begin by extending our discussion of wave mechanics in Sections 6.1 through 
6.5 to three dimensions. 1 The position eigenstate in three dimensions is given in 
Cartesian coordinates by 


|r) = \x, y, z) 

where 

v|r)=x|r> y|r} = y|r) z\x) = z\r) 

We express an arbitrary state |i jr) as a superposition of position states by 


W) 


-Iff 


dx dy dz \x, y, z){x, y, z\xfr) 


/ 


d r |r){r|i/r) 


(9.1) 


(9.2) 


(9.3) 


1 It would be good to review those sections of Chapter 6 before reading Section 9.1. 


303 


Page 319 (metric system) 



CHAPTER 9 


Translational and Rotational Symmetry 
in the Two-Body Problem 


9 


After spending Chapters 6, 7, and 8 in one dimension, we now return to the three- 
dimensional world and consider a system consisting of two bodies that interact 
through a potential energy that depends only on the magnitude of the distance 
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invariance, or a symmetry, in the system and a corresponding conservation law is 
one of the most fundamental and important in physics. 

9.1 The Elements of Wave Mechanics in Three Dimensions 


Let’s begin by extending our discussion of wave mechanics in Sections 6.1 through 
6.5 to three dimensions. 1 The position eigenstate in three dimensions is given in 
Cartesian coordinates by 


|r) = \x, y, z) 


where 


(9.1) 


i|r)=x|r) y|r)=y|r) z|r)=z|r) 

We express an arbitrary state \^) as a superposition of position states by 


l<A> 


-///* 


dx dy dz |x, y, z)(x, y, z\x(r) 


-I 


d 3 r |r) (r10-> 


(9.2) 


(9.3) 


1 It would be good to review those sections of Chapter 6 before reading Section 9.1. 
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a here the integrals run over all space. 11" we consider the special case where the state 
. = x’. v\ z'>, a position eigenstate, we see that 


(■X , V, z\x', y, z) = <$(.v -.r')i5(j - y')8{z - ; 

z) (9.4) 

or more compactly 


<r|r') = 5 3 (r-r') 

(9.5) 

The superscript on the Dirac delta function emphasizes that this is actually three 
delta functions. 

Using the normalization condition, we see that 

l = (\f/\\lr) = JJJ dx dv dz \(x, y, z\\//)\ 2 — j d*r 

l(r|y)| 2 (9.6) 

indicating that we should identify 


dxdydz\(x,y, z\f)\ 1 = d\\(rm 2 

(9.7) 

with the probability of finding a particle in the state \ijr) in the volume d 2 r at r if a 
measurement of the position of the particle is carried out. 

Just as we did in one dimension, we now introduce a three-dimensional transla¬ 
tion operator that satisfies 

T (« x i)|jr, y, z) = |x + u x . y, z) 

(9.8a) 

na y j)|*, y, z) = \x, y + a y , z) 

(9.8b) 

7’(a z k )k. V, z) - \x, y, z + a,) 

(9.8c) 

or. in short, 


f(a)|r) = |r + a> 

* (9.9) 

As in (6.26), these translation operators can be expressed in 
generators of translations p x , p y . and p z : 

terms of the three 

T(a x i) = 

(9.10a) 

f (fl -v j) = e-‘i’> u y /n 

(9.10b) 

f(fi.k) = e -'^ /H 

(9.10c) 
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y 
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m -• 


dy 


•-► • 

A a * C 


x 


Figure 9.1 Translations along different directions com¬ 
mute: the translation operator T(a x i)T (a v j), which is indi¬ 
cated by the path ABD, has the same effect as the translation 
operator f (a y j)T(a x i), which is indicated by the path ACD. 


In contrast to what we saw with rotations in Chapter 3, successive translations in 
different directions, such as in the x and y direction, clearly-commute with each 
other (see Fig. 9.1). Thus 


T (a y 3)f (a A .i) = f (a x i)T(a y j) 
If we substitute the series expansion 


f(a x \) = 1 - 


iPx“x 

n 


^2 2 
P X jc 

2 n 2 


+ ■■■ 


(9.M) 


(9.12) 


and the corresponding expression for T(a,j) into (9.11) andretain terms through sec¬ 
ond order, we can show that the generators of translations along different directions 
commute: 


[p X 'Pyl = 0 (9.13) 

See Problem 9.1. We can thus express the three-dimensional translation operator 
simply as 2 


f (a) — e -iPxO.JH e -iPya y /H e -iPz“z/^ — e ~i P-a/ft (9.14) 

As we saw in Chapter 6, the generator of translations in a particular direction 
does not commute w'ith the corresponding position operator. In three dimensions, 
this leads to the commutation relations 

[x, p x ] - ill [y, p y ] - ih f z, p z ] — ih (9.15) 


2 The product of two exponential operators can be replaced by the exponential of the sum of 
the two operators since the two operators commute. Sec Problem 7.19. 
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However, the generator of translations along an axis—for example, the x axis— 
does commute with the position operator along an orthogonal direction, say the y 
direction: 


T(a x i)y\i/f) = f (a x i)y jjj dx dy dz |x, y, z) (x, y, z\\jj) 

= T (a x i) Jjj dx dy dz y\x, y, z){x, y, z\f) 

= Jff dxdydz y\x + a x , y, z)(x, y, z\f) 

= yf (a x i)\\jf) 


(9.16) 


which indicates that 


[y,T(a x i)] = 0 (9.17) 

since |t^) is arbitrary. Notice in this derivation that it is really adequate to verify 
that the operators commute when acting on an arbitrary position eigenstate |.v, y, z) 
because, as (9.3) shows, we can express any state as a superposition of these position 
states. For (9.17) to be valid for arbitrary a x , 

[y, p x \ = 0 (9.18) 

In fact, the complete set of position-momentum commutation relations can be ex¬ 
pressed in the shorthand form 


&, Pj] = ihSij (9.19) 

where i and j each run over 1, 2, and 3, representing x, y, and z components, 
respectively (fy = x, x 2 = y, and x 3 = z). 

The generators of translations are of course the momentum operators. Since these 
operators commute with each other, we can form three-dimensional momentum 
states that are simultaneously eigenstates of p x , p v . and p z : 


\P X ,Py,P Z ) = lp> ( (9.20) 


where 


PjtlP) = PxlP> PvlP) = fyvlP) Pz\V) = Pz\V) (9-21) 

As with the position states, we normalize the momentum states by 

(p'lp) = <5 (p' x - p x )8(p' y - p y )8(p' z - Pz ) = <r(p - p) (9.22) 
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and therefore 

d 3 p\(pm 2 (9.23) 


is the probability of linding the momentum of a particle in the state |i p) between p 
and p + dp. 

Finally, we can establish that the generalization of (6.42) is given by 


(rlPlV') = — V (r|t/r) (9.24) 

i 

Taking |t//) = ip), a momentum eigenket, we can solve this differential equation [as 
we did (6.51)] to obtain the three-dimensional momentum eigenfunction in position 


space: 


(r|P> = 


1 


JPxx/h 


yPhtK 

1 

(2jr ff) 3 / 2 




e ‘PyylH 


y/lTlh 


JL e iPz z /f> 


JP-r/h 


(9.25) 


which is just the product of three momentum eigenfunctions like (6.54). 


9.2 Translational Invariance and Conservation of 
Linear Momentum 


The Hamiltonian for two bodies with a potential energy of interaction that depends 
on the magnitude of the distance separating the two bodies is given by 

~2 ~ 2 

H = ~~ + ~~ + y (I?, — ? 2 |) (9.26) 

Zni j 2 fti 2 

where pj is the momentum operator for particle 1 and 

P 2 i = Pl + P% + Pl (9.27) 

Similarly, p 2 is the momentum operator for particle 2. It may seem strange to begin 
our discussion with a two-body problem instead of a one-body problem. However, 
any nontrivial Hamiltonian arises from the interaction of one body with at least one 
other body, so we might as well start with the two-body system. By far the most 
important example of a two-body system for which the Hamiltonian is in the form 
of (9.26) is the hydrogen atom, where the potential energy V = —e 2 /|r, — r 2 |. We 
will take advantage of what we learn in this chapter to solve the hydrogen atom, as 
well as some other two-body problems, in Chapter 10. 

For the time being we are presuming it is safe to neglect any spin degrees of 
freedom, so we introduce just the two-body position basis states 

l r i, T 2 > = |ri) j <8> |r 2 ) 2 (9.28) 
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Figure 9.2 Translating both bodies in a two- 
body system by a leaves the distance between 
the bodies unchanged. 


The right-hand side expresses these two-particle states in terms of the direct product 
of single-particle position states, just as in Chapter 5 we expressed the two-particle 
spin states of two spin-j particles as a direct product of single-particle spin states. 
Notice that we can translate the position of particle 1 leaving the position of particle 2 
fixed: 


7j(a)|r,, r 2 ) = e 'P' a Vi- r 2 ) = |r, + a. r 2 ) (9.29a) 


and similarly 


f 2 (a)|r 1 , r 2 ) = e 'P 2 ' a/ V,, r 2 > = |r,, r 2 + a) (9.29b) 

Thus we see that the generators commute: 


[Pi, p 2 ] = 0 (9.30) 

and that the translation operator that translates both of the particles is given by 

f,(a)f 2 (a) - *-'Pr«/» e -''P2-a/ft _ e -n Pi+P2>-a/« __ (9 31) 

where 

P = Pi + P2 (9.32) 

« 

is the total-momentum operator for the system. 

Since translating both of the particles does not affect the distance between them, 
as indicated in Fig. 9.2, we expect that the two-particle translation operator should 
commute with the Hamiltonian (9.26). This is an important result, worth examining 
in detail. As noted in the previous section, it is sufficient to show that the operators 
commute when acting on an arbitrary two-particle position state, because we can 
express any two-particle state |i ft) as a superposition of the two-particle position 
states: 

l*>=/7 d\ d\ 2 |r„ r 2 )<r„ r 2 |^> (9.33) 
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Thus 

7i(a)7 2 (a)V'(|rx — fSDIrj, r 2 ) = Ti(a)7 , 2 (a)V'(liq — r 2 |)|rj, r 2 ) 

= V (|rj — r 2 1)[ri + a, r 2 + a) 

= V(|r, - r 2 |)|r, + a. r 2 + a) 

= V (IH - r 2 l)T 1 (a)7 , 2 (a)|r 1 , r 2 ) (9.34) 

where in the next-to-last step we have taken advantage of the fact that 

f]iT] + a, r 2 + a) — (ij + a) liq + a, r 2 + a) (9.35a) 

i* 2 [ri + a. r 2 + a) = (r 2 + a)|T] + a, r 2 + a) (9.35b) 

and thus 


(r, - r 2 )|r, + a. r 2 + a} = (iq - r 2 )|rj + a. r 2 + a) (9.36) 

Equation (9.34) shows that 

[V(|fj - f 2 |), f,(a)f 2 ( a )] = 0 (9.37) 


From the explicit form of 7’ 1 (a)T 2 (a) in terms of the momentum operators, it is also 
clear that 


Pi p 2 - 

f 1 - + r,(a)r 2 (a) 

2 fTl^ 2/7? 2 


-0 


(9.38) 


and therefore 

[£,7i(a)f 2 (a)] = 0 (9.39) 


Thus from (9.31) we see that the Hamiltonian commutes with the operator that 
generates translations for both of the particles: 

[H, P] = 0 (9.40) 


Recall from (4.16) that 

~ i = kf\lH,PU) (9.41) 

dt h 

Thus the translational invariance of the Hamiltonian guarantees that the total mo¬ 
mentum of the system is conserved. Translational invariance is another illustration of 
the deep connection between symmetries of the Hamiltonian and conservation laws. 
At the end of Chapter 7 we saw that the harmonic oscillator possesses inversion 
symmetry; the parity operator inverts the coordinates and leaves the Hamiltonian 
invariant. Thus the Hamiltonian and the parity operator commute and parity is con¬ 
served. Inversion symmetry is a discrete symmetry. Translation, on the other hand, is 
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a continuous symmetry operation for the two-body Hamiltonian in that the Hamilto¬ 
nian is invariant under translations by an arbitrary distance, leading to conservation 
of linear momentum. 

Notice that if we look at how a particular slate |i/r(0)) evolves with time, 

|^(/))=e-'*''V(0)) (9.42) 

we see that the translated state f (a)l^r(O)) at time t differs from the state \ fU )) by 
just a translation: 

e ~iHt/rij ( a )|^( 0 )) = f (a)e -fA/A |Vr(0)) = f(a)|^(0) (9.43) 


since the translation operator commutes with the Hamiltonian. Thus if you were 
to carry out experiments in a movable laboratory (without windows), you would 
not be able to determine whether the laboratory had been displaced based solely on 
experiments carried out within the laboratory. 

In our analysis in this section, we have used translational invariance to argue that 
momentum is conserved. However, we can also turn the argument around: If mo¬ 
mentum is conserved, the system is translationally invariant because the momentum 
operator is the generator of translations. What would break or destroy this transla¬ 
tional symmetry? From classical physics we know the momentum of the system is 
not conserved if an external force acts on the system. Suppose that in our example of 
the hydrogen atom we insert a third charge q at position r 3 , which interacts with both 
the proton at r, and the electron at r 2 . The Hamiltonian of the three bodies including 
just their Coulomb interactions is then given by 


fj_ Pi t P2 . Ps e 2 [ <ie _ ge 

2m, 2m 2 2m 3 |r, - f 2 | |f 1 — r 3 | |f 2 - ? 3 | 


(9.44) 


We see that translating both the electron and the proton (r, -» r, + a and r 2 -* 
r 2 + a) does not leave the Hamiltonian invariant. Therefore, total momentum of the 
electron-proton system is no longer conserved. However, if we enlarge our definition 
of the system to include all three particles, this three-particle system is invariant un¬ 
der translations of all the particles (r, -» r, + a, r 2 -»■ r 2 + a, and r 3 -» r 3 + a), and 
thus the total momentum of the three-particle system is conserved. This translational 
invariance is not an accident but is built into the law's of electromagnetism, and not 
simply for static Coulomb interactions. In fact, all of the fundamental interactions— 
strong, weak, electromagnetic, and gravitational—seem to respect this translational 
symmetry. Thus, if w'e extend our definition of any system to include all of the bod¬ 
ies and fields that are interacting, we can be sure that the momentum of this system 
is conserved and that any experiment carried out on the system will give the same 
results as those carried out when the system is translated to a different position. This 
latter fact is often expressed by saying that space is homogeneous. Without this ho¬ 
mogeneity w'e w'ould have no confidence in our ability to apply the laws of physics 
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as deduced, for example, from the behavior of hydrogen atoms here on Earth to 
hydrogen atoms radiating in the distant interstellar medium. 


9.3 Relative and Center-of-Mass Coordinates 


The natural coordinates for the two-body problem when the Hamiltonian is of the 
form (9.26) are relative coordinates r and the centcr-of-mass coordinates R, not the 
individual coordinates rj and r 2 of the bodies. The corresponding position operators 
are given by 

r = r i - r 2 

£ _ m i*i + m 2 r 2 
m j + m 2 

Using the commutation relations (9.19) for each of the individual particles and (9.30), 
we see that the total-momentum operator (9.32) satisfies the commutation relations 

\x h Pj\= 0 (9.46) 

which also follows from the invariance of the relative position under total transla¬ 
tions. In addition. 


(9.45a) 

(9.45b) 


[X i ,P J ] = ihS ij (9.47) 

which shows that the total momentum and the position of the center of mass obey 
the usual canonical commutation relations of position and momentum. We also 
introduce the relative momentum operator 


P = 


m 2 Pi — «i]P 2 
m x + m 2 


(9.48) 


which satisfies the canonical commutation relations with r: 


[x i ,Pj] = ihS i j (9.49) 

as well as the commutation lelation 

[Xi,pj] = 0 (9.50) 

with R. Commutation relations (9.46) and (9.50) show that the relative and center- 
of-mass operators all commute with each other. 

We will use the states |r, R) instead of |r l5 r 2 ) as a basis for our discussion of the 
two-body problem. The reason for this choice becomes apparent when we express the 
two-body Hamiltonian (9.26) in terms of the relative and center-oi-mass operators. 
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We find 


W = —+ ^ + V(|r|) 
2 M 2 n 


(9.51) 


where 


M — m i + in 2 


(9.52a) 


is the total mass of the system and 

m.mi 

M =-— 

m, + m 2 


(9.52b) 


is the reduced mass. See Problem 9.5. The Hamiltonian (9.51) is the sum of the 
kinetic energy of the center of mass 



(9.53) 


and the energy of the relative motion of the two particles 


aT 

Wrel = 7 1 + V(|f|) (9.54) 

2 AT 

Since these two operators commute with each other, they have eigenstates | £ cm , £ rel ) 
in common: 

H frd) = (Am + H rcl )|£ cm , £ rel ) = (£ em + E rel )|£ cm , £ rel ) (9.55) 


and hence the energy eigenvalue of the two-body Hamiltonian is £ = £ cm + £ re |. 

The eigenstates of H cm are just those of the total-momentum operator P. In 
position space, the total momentum eigenfunctions are given by 

(R|P> = ^^‘ R/fi (9 ‘ 56) 

as in (9.25) except that here the momentum P is the momentum of the center of 
mass and the position variable R is the position of the center of mass. It is common 
to analyze the two-body problem in the center-of-mass frame, where P = 0 and 
therefore £ = £ rcl , since then the kinetic energy of the center of mass vanishes. 
Thus from now on we will concentrate our attention on just the Hamiltonian 

- 2 

H = 37 1 + V(|?|) (9.57) 

2m 

This Hamiltonian is the same as that for a single body in the central potential V(r), 
provided the mass of the body is taken to be the reduced mass of the two-body system. 
This is the familiar result from classical mechanics, but here expressed in terms of 
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operators. Thus in analyzing the Hamiltonian (9.57), we are analyzing a single body 
in a central potential as well as two bodies interacting through a potential energy 
that depends on the magnitude of the distance between them. 

9.4 Estimating Ground-State Energies Using the 
Uncertainty Principle 


Much of the remaining discussion in this chapter on orbital angular momentum will 
cover material that we will use in Chapter 10 in the determination of the energy 
eigenstates and eigenvalues of the Hamiltonian (9.57) for a number of specific central 
potentials. For now, it is useful to be able to estimate the energy scale for systems 
like the hydrogen atom without actually solving the energy eigenvalue equation. 
The Hamiltonian for the hydrogen atom, including only the predominant Coulomb 
interaction between the particles, is given by 


fr = £- e - 


2/i 


(9.58) 


with the reduced mass being that of the electron-proton system. 3 

The expectation value of the Hamiltonian (9.58) in the ground state is given by 


2n r 


(9.59) 


We denote this energy by £j because, as we will find in Chapter 10, this state is 
labeled by the principal quantum number n = 1. Using dimensional analysis, we 
can express 


e~ 

a 


(9.60) 


where a is a length, char acteristic of the size of the atom, that we will now estimate. 
But if the atom has a finite size, the uncertainty in the relative position of the two 
particles is also at most on the order of a. A finite position uncertainty means there 

0 

must be a finite momentum uncertainty as well. From the Heisenberg uncertainty 
relation, we expect that 

|Ap|£- (9.61) 

a 

Note that we have not actually calculated the position uncertainty and thus the value 
we are taking for the momentum uncertainty is a rough estimate. 


•’ In SI units, the potential energy is — e^/Axe^r. If you want to work in SI units, just consider 
<r a shorthand notation for e 2 /AireQ. 
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The expectation value of the kinetic energy is 


(P 2 ) _ Ap : + (p) 2 _ Ajr 
2/u 2/u 2jr 


(9.62) 


where in the last step we have taken (p) = 0, since {p> is independent of time in a 
stationary state—the ground state is, of course, an energy eigenstate—and if (p) ^ 0, 
the system would not stay within a particular region of space. Our estimate of the 
total energy is thus given by 




h 2 

2 iia- 


e 2 

a 


(9.63) 


Decreasing the value for a decreases the potential energy. However, it also increases 
the kinetic energy. Clearly, there is an optimum value for a that minimizes the 
energy. 4 Setting dE^/da = 0. we lind 

h 2 

a = —- (9.64) 

m e e l 


and hence 


4 

_ m e e* 

E\ ~- e — 

2 h 2 


(9.65) 


where we have replaced the reduced mass by the electron mass since the two differ 
by only 1 part in 2000. If we put in numerical values for the mass and charge, we find 
that a is on the order of angstroms and the energy is on the order of 10 e V. Although 
our estimates are strictly only order-of-magnitude estimates and we should be lucky 
to be within a factor of two of the exact ground-state energy, we have judiciously 
chosen (9.61) so that (9.65) turns out to be the exact value (13.6 eV) that we will 
find in Chapter 10. 

The important thing to note at this point is how quantum mechanics has saved 
the atom from collapse. In classical physics, one could always lower the energy of 
the system by putting the proton and the electron closer together. In fact, before the 
discovery of quantum mechanics the stability of atoms was a puzzle. But we now see 
that making the atom smaller increases the kinetic energy of the system so that there 
is a natural resistance to compressing the atom into too small a region. Quantum 
mechanics with its own fundamental constant, Planck’s constant, has set the natural 
length scale (9.64) for atomic physics, as well as the natural energy scale (9.65). 


9.5 Rotational Invariance and Conservation of 
Angular Momentum 


Let’s continue our general analysis of the Hamiltonian (9.57). One of the first 
things we notice about this Hamiltonian is that it possesses rotational symmetry. 


4 Also see the discussion in Section 12.2 on the variational method. 
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since p 2 = p • p = p 2 + p 2 + p 2 and r = |r | = (r • r) '/ 2 = (x 2 + >’ 2 + z 2 ) 1/2 are both 
invariant under rotations; they involve the length of a vector and the length of a vector 
doesn’t change when it is rotated. The Hamiltonian, however, involves operators, not 
just ordinary numbers, and it is instructive to verify explicitly that it is rotationally 
invariant. 

Let’s consider the operator R(dcpk ), which rotates a position state counterclock¬ 
wise about the z axis by an angle d(j). Notice that there is nothing in the Hamiltonian 
that picks out a specific direction in space, so it is completely arbitrary which direc¬ 
tion we choose to call the z direction. Using (3.2), which shows how an arbitrary 
vector changes when rotated about the z axis, we see that for an infinitesimal rotation 


R(dcpk)\x, y, z) — \x — y dcp, y + x dcp, z) (9.66) 

•> 

We express the rotation operator R(dcpk) in terms of the generator of rotations as 

R(dcpk) = 1 — —L 2 dcp (9.67) 

h 


where we have called the generator L z instead of J z , as in Chapter 3, because, as we 
will now see, this generator is the orbital angular momentum. Taking advantage 
of (9.8), (9.10), and the form of the translation operator (9.12) when the translation 
is infinitesimal, we can write through first order in the infinitesimal angle dcp 


\x - y dcp , y + x dcp, z) = 


- jPx(-y d< $>) 
h 


1 - jPy(x dcp) 

h 


1 - -r(xp y - yp x ) dcp 
n 


\x, y, z) 


\x, y, z) 
(9.68) 


Thus the generator of rotations about the z axis in position space is simply 


L z = xp y - yp x (9.69) 

which is the z component of the orbital angular momentum operator L = rxp. We 
finally see orbital angular momentum entering as the generator of rotations because 
we have turned our attention to rotations that move position states around. 

One way to confirm that the Hamiltonian is rotationally invariant is to check 
that it commutes with the generator of rotations. Using the position-momentum 
commutation relations (9.49) and the form (9.69) for L z , we find that 

[t, Pxi = [xp y - yp x , p x ] = [Xpy, Px) 

= lx, p x ]p y = ihpy (9.70a) 

[L z , Py\ = L xpy - yp x , Py] = ~[yp x , Py] 

= “ty. Py\P x = ~ihPx (9.70b) 

[T z , p z ] = [xp y , - yp x , p z ] = 0 (9.70c) 
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and therefore 

[L z , p 2 \ = [L zt p 2 x + p 2 y + p 2 z ] 

= Px\l z , Px 1 + [£;• Px\Px + PylL z , p y ] + l L-, p y ]p y 


— 2 ifiPxPy — 2 itiPxPy = 0 

(9.71) 

Similarly, we can establish 


[L z ,x] = My 

(9.72a) 

IL Z , y] = -ihx 

(9.72b) 

[L z ,z] = 0 

(9.72c) 

and therefore 


[L z , x- + y 2 + z 2 ) = 0 

(9.73) 


as well.- Thus L. commutes with a potential energy that is a function of the magni¬ 
tude of the radius vector. 

There is another instructive way to establish that V(|r|) is invariant under ro¬ 
tations. We express a particular point in position space in terms of the spherical 
coordinates shown in Fig. 9.3: 

I*, y, z) = |r, 9, <p) (9.74) 

The advantage of these coordinates is that the action of the operator R(dcp k) on these 
states is transparent. Namely, 

R(d<pk)\r, 9 , 0) = |r. 9, <p + d<t>) (9.75) 


5 Notice the similarity in how r and p transform under rotations. In fact, the commutation 
relations (9.80) of the orbital angular momentum operators can be cast in a similar form as well: 

If.,, L x ] = itiL y \L._, L y l = -ihL x [L. £.,] = 0 

All vector operators must behave the same way when they are rotated. See Problem 9.6. 
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is clearly a rotation ol' the state by angle dcp about the z axis. Thus 

R(d<t>k)V (|r|)|r, 9. <P) = R(d<pk)V(r)\r, (9, 0) 

= V (r)|r. 6, <p + d(p ) 

= V (|f|)/?(r/0k)|r, 9, <f>) (9.76) 

Since the two operators commute when acting on an arbitrary position state, the two 
operators commute in general. 

We have established that the Hamiltonian (9.57) commutes with the generator ol' 
rotations about the z axis: 


[H,L 2 ] = 0 (9.77) 

What we have chosen to call the z axis could just as easily be called the .v or the 
y axis by someone else. Therefore it must be true that 

[fl,LJ = 0 (9.78) 

and 

[H, L y 1 = 0 (9.79) 

The system is invariant under rotations about any axis. Since the Hamiltonian com¬ 
mutes with the angular momentum operators, angular momentum is conserved. 
Rotational symmetry has led to a conservation law—conservation of angular mo¬ 
mentum! Moreover, since the Hamiltonian commutes with the rotation operator, 
rotating a system does not affect the time evolution of the system, provided every¬ 
thing interacting with that system is rotated as well. We say this rotational invariance 
is a reflection of the isotropy of space, just as translational invariance is a reflection 
of its homogeneity. 

9.6 A Complete Set of Commuting Observables 


Our goal in this section is to take advantage of what we have learned in the preceding 
section about the symmetry of the Hamiltonian (9.57) to specify the energy eigen¬ 
states as well as write out the energy eigenvalue equation in position space. We will 
see how to reduce the three-dimensional Schrodinger to a one-dimensional equation 
for the radial part of the wave function. This equation is one that we will solve in 
Chapter 10 for a variety of central potentials V (r), including the hydrogen atom, the 
free particle, and the isotropic harmonic oscillator. 

First, let us note that since L x , L x . and L. are the generators of rotations, they 
must satisfy the general commutation relations 

l L x , L y \ = iht z [L y , L z 1 = ihL x [L z , L x ] = ihL y (9.80) 
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as discussed in Chapter 3. A useful way to express these commutation relations, as 
well as the commutation relations of any angular momentum operators, is in terms 
of the completely antisymmetric tensor E t j k . Complete antisymmetry means that the 
tensor changes sign if any two of the indices are interchanged: c ;/i . = —E ]lk and so on. 
The other defining relation is that £123 — 1- The complete antisymmetry determines 
all of the other components. For example, £ 132 = —£123 = — 1 and if any two of 
the indices are the same, the tensor vanishes (e ( i2 = —£112 — 0)- The commutation 
relations (9.80) may now be put in the more compact form 

3 

(9-81) 

*=1 

where j, and k lake on the values from 1 to 3 and L), L 2 , and L 3 stand for L x , L y , 
and L z , respectively. In fact, using we can express the ordinary cross product 
in component form as 

3 3 

= <9.82) 

7=1 *=1 

Since the generators of rotations about different axes do not commute with 
each other, we cannot choose more than one of them to simultaneously label the 
eigenstates of the Hamiltonian. However, since each of the generators as well as the 
Hamiltonian commutes with 

L 2 = L] + L\ + l\ (9.83) 

A * A 

we can form simultaneous eigenstates of H, L , and one ot the components of 
the angular momentum, which is generally taken to be L z . We then label these 
eigenstates by |E, /, m), where 


H\E, l, m) = E\E, /, m) 

(9.84a) 

L 2 \E, l, m) =1(1 + l)h 2 \E, l, m) 

(9.84b) 

L,\E , /, m) = mh\E, /, m) 

(9.84c) 


The Hamiltonian (9.57) actually involves the operator L 2 , although it is hidden 
away as part of the rotational kinetic energy. To see this, we first use the identity (see 
Problem 9.7) 

L 2 = rxprxp = r 2 p 2 — (r ■ p) 2 -I- i fir • p (9.85) 

Since we wish to express the energy eigenvalue equation in position space, we now 
evaluate 


(r|L 2 |v>> = (r|[r 2 p 2 - (r • p) 2 + ihr • pj|^> (9.86) 
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Note that 


<r|r 2 p 2 |i/r) =r 2 (r|p 2 |Vr) 


(9.87) 


and 


Thus 


(r|r • p|^r) = r • -V(r| \j/) = r\\/f) 

i i dr 


(r|(r-p) 2 | f) = -n 2 r^~ 

dr 


dr ) 


W) 


Combining these results, we see that 


fi 2 ( a 2 2 a 


— <r|p-| f) = —r + (r\yfr) + 


(r\V-W) 

2 fir 2 


2fx'"' 2fi \9r 2 r dr 

and thus the position-space energy eigenvalue equation is given by 


(r\^W) + (r\V(\rm) 

2fx 

n 2 ( a 2 2 3 \, ... 

-~2^W + ~rTr) m)+ 2»r* 


<r|L 2 |Vr> 


(9.88) 


(9.89) 


(9.90) 


+ V(r)(r\f) = E(r\r/f) (9.91) 


The kinetic energy (9.90) has two parts. One of the parts is easily recognizable 
as the rotational kinetic energy L 2 /27, with a moment of inertia 7 — fir 2 —just the 
moment of inertia that you would expect for a mass /i rotating a distance r from a 
center of force. The other part of the kinetic energy must be the radial part. We can 
express this part in a form familiar from classical mechanics if we define the radial 
component of the momentum operator 


or 


< r l Pr\t) = T 

l 



<r|.*> 


(9.92a) 



(9.92b) 


in position space. 6 Expressed in terms of this operator, the radial part of the kinetic 
energy becomes 


2_9_\ 

2 n \9r 2 r dr) 


(W) 


(rl p 2 r \f) 

2 fi 


(9.93) 


6 The form for p r in position space may seem a little strange. See Problem 10.1. 
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If we choose the state \rfr) to be a simultaneous eigenstate H. L 2 , and L : , that is, 
|\!/) — | E, /, m), (9.91) becomes 


. 2/t 


(<P_ 2_a_\ 

\3r 2 r dr) 


+ 


/(/ + \)fi 2 
2 Mr 2 


+ V(r) 


(r\E,l,m) = E(r\E,l,m) (9.94) 


Note that the expression in brackets in the left-hand side of this equation depends 
only on r, not on the angles 9 and <p. If we express the wave function in the form 


(r| E,l,m) = R(.r)Q(6)<i>(<p) (9.95) 


we obtain the radial differential equation 


2 m 


( 


dr 2 d \ 

— 7 5 - T~ I 5 " 

dr r dr J 


W + l)/r 

2 fir 2 


+ V{r) 


R(r) = ER(r ) 


(9.96) 


where we have divided out the angular part of the wave function. 

Equation (9.96) is a very important and useful result. We have succeeded in 
reducing the three-dimensional Schrodinger for an arbitrary central potential V (r) 
to a one-dimensional radial equation. We will devote Chapter 10 to solving this 
equation for a number of specific central potentials. For now, we note that if we 
introduce the function u(r) through 


/?(/-) = 


u(r) 


(9.97) 


the radial equation simplifies to 

H 2 d 2 | HI + 1 )fr 


V(r) 


u(r ) = Eu(r) 


(9.98) 


L 2m dr 2 2[ir 2 

This radial equation has the same form as the one-dimensional Schrodinger equation 


h 2 d 2 

~27n77 2 + Vix) 


(x\E) = E{x\E) 


(9.99) 


but with an effective potential energy 

VWM = l1 ^- + v(r) (9.100) 

2/xr- 

This means that you can cam' over any techniques, numerical or otherwise, that 
you know from solving the one-dimensional Schrodinger equation to help solve the 
radial equation. 

Note that the lack of dependence of the energy eigenvalue equation (9.96) [or 
(9.98)] on the eigenvalue m of L z is a direct manifestation of the rotational invariance 
of the Hamiltonian. In essence, there is no preferred axis picked out in space, such 
as w'ould be the case if an external magnetic field were applied to the atom in, for 
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example, the z direction, in which case the energy would indeed depend on the 
projection of the angular momentum along this axis. Nonetheless, we still need the 
m value in (9.84) in order to specify each state uniquely—there are. after all, 21 + 1 
different in states for each value of /. The set of operators that commute with each 
other that are necessary to label each state uniquely is referred to as a complete set of 
commuting observables. For a given system, there may exist several complete sets 
of commuting observables. For example, for the Hamiltonian (9.57) w e could use H, 
L 2 , and L x to label the states instead of H , L 2 , and L : . However, for a real hydrogen 
atom neither of these sets of operators is complete, since it is necessary to specify 
both the electron and proton’s intrinsic spin state in order to label the states uniquely. 
Assuming that the Hamiltonian is one of our complete set of commuting observables, 
we need to know how the spin operators Sj of the electron and S 2 of the proton enter 
into this Hamiltonian before we can determine the other members of the complete set 
of operators that commute with H. For example, the form of the hyperfine spin-spin 
interaction of the electron and proton given in (5.9) shows that neither S\, nor S 2z 
commutes with the Hamiltonian because they do not commute with S) ■ S 2 . In this 
case we would choose the total spin operators S' = (S ( -I- Si)' and S. = S t . + S 2z 
as well as H, L 2 , and L.. As we will see in Chapter 11, the Hamiltonian for the 
real hydrogen atom is even more complex, involving the coupling of the spins of the 
particles to the relative orbital angular momentum. 


EXAMPLE 9.1 Instead of spherical symmetry, consider a system that has 
cylindrical symmetry about the z axis. Which operators form a complete set 
of commuting observables for this system? 

SOLUTION For a system with cylindrical symmetry about the z axis, there 
are two symmetries: rotational symmetry about the z axis and translational 
symmetry along the z axis. Since L. is the generator of rotations about the z 
axis and p. is the generator of translations along the z axis, the Hamiltonian 
H, L-, and p z all commute with each other and form a complete set of 
commuting observables for this system. 


9.7 Vibrations and Rotations of a Diatomic Molecule 


An interesting two-body system in which, to a first approximation, the radial motion 
of the particles decouples from the angular motion is formed by the nuclei of a 
diatomic molecule, such as HC1. A schematic diagram of the potential energy V(r) 
of such a molecule is shown in Fig. 9.4. At large distances the atoms in the molecule 
attract each other through van der Waals forces, while at short distances, when the 
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Figure 9.4 A schematic diagram 
of the potential energy V(r) of a 
diatomic molecule. 


electrons in the atoms overlap, there is a strong repulsion." In between there is a 
minimum in the potential energy. As we argued in Chapter 7, in the vicinity of the 
potential energy minimum at r 0 , the system behaves like a harmonic oscillator and 
we can write 


v<r)-vw + i(£ + 

I 7 7 

= v (r 0 ) + - r 0 r H- 


(9.101) 


where /x. the reduced mass of the two nuclei, is on the order of M N , the nuclear 
mass. 

In general, the potential energy of the molecule is on the order of e 2 /a, where 
the size a is roughly the same as for atomic systems, namely, a = fr/m e e 2 . Since 
the size scale is set by a, then by dimensional analysis 


d 2 V e 2 

- 'V - 

dr 2 a 3 


The spacing between vibrational energy levels is thus given by 


tus) - h 




(9.102) 


(9.103) 


The second factor in parentheses is the electronic energy scale given in (9.65). Since 
the factor (m e /M N )^ 2 is on the order of 1/40 for a diatomic molecule such as HC1, 
the wavelength of photons emitted or absorbed when the system changes from one 
vibrational energy level to an adjacent one is roughly 40 times longer than for a 


7 Because of their small mass, the electrons in the molecule move rapidly in comparison with 
the nuclei and thus readjust their positions very quickly when the nuclear positions change. 


Page 339 (metric system) 




9.7 Vibrations and Rotations of a Diatomic Molecule | 323 


Axis of rotation 



Figure 9.5 A classical model of a 
diatomic molecule rotating about its 
center of mass. 


typical atomic transition and is thus in the infrared portion of the electromagnetic 
spectrum. The purely vibrational energies are given by 8 

E n v = («v + ;) n v — O' 1< 2*... (9.104) 


Examination of the energy eigenfunctions of the harmonic oscillator shows that 
for states of low excitation the diatomic molecule vibrates over a distance scale 
given by 



Thus the amplitude of vibration (in states of low vibrational excitation) is only a 
small fraction of the equilibrium separation r 0 of the nuclei in the molecule. For this 
reason, we can say that the molecule is fairly rigid, and we can treat the rotational 
motion separately from the vibrational motion. 

In a particular state of vibration, the molecule is still free to rotate about its center 
of mass, forming a rigid rotator (see Fig. 9.5). The Hamiltonian for such a rotator is 
given by 


- L 2 

// = — (9.106) 

where the moment of inertia I = /rr 2 . This is exactly the form of the rotational part 
of the kinetic energy operator in (9.91), with the value of the radius replaced by 
the equilibrium separation. The eigenstates of this Hamiltonian are just the angular 
momentum eigenstates: 

f2 I/I , r\fc2 

— 1 l,m)= 2{ — \l,m) = E,\l,m) (9.107) 


8 We will call the vibrational quantum number n v instead of n, as was done in Chapter 7, 
because for atoms and molecules the principal electronic quantum number is generally referred to 
as «. See Section 10.2. 
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Figure 9.6 (a) An energy-level diagram of a three-dimensional rigid rotator, (b) Transi¬ 
tions between adjacent energy levels generate the rotational spectrum. 


An energy-level diagram is shown in Fig. 9.6. The spacing between adjacent energy 
levels is given by 


Ei — £)_ i — 


id + m 2 (/ - 1 m 2 


(9.108) 


Notice how this energy spacing increases with increasing I, in contrast to the constant 
spacing between levels characteristic of the harmonic oscillator. The magnitude of 
this energy spacing is on the order of 


Itl 2 j fir _ i m e f m e e 

M N a 2 M, v V H 2 


(9.109) 


The predominant electric dipole transitions obey the selection rule Al — ±1, as 
we will see in Chapter 14. Thus, comparing (9.103) with (9.109), we see that the 
wavelength of photons emitted or absorbed in transitions between adjacent rotational 
energy levels of low / is a factor of ( M N /m e ) l/2 longer than that for the vibrational 
transitions. Purely rotational transitions reside in the very far infrared or short 
microwave portion of the electromagnetic spectrum. The energy spacing between 
levels is on the order of 10 -2 -10 -3 eV. Since k#T at room temperature is 1/40 eV, 
many of these levels will be excited at this temperature. 
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Figure 9.7 The absorption spectrum of HC1. Adapted from D. Bloor et al., Proc. Roy. 
Soc.. A260. 510 (1961). 


Figure 9.7 shows the purely rotational absorption spectrum of HC1. Notice that 
the values of I are all integral. Setting E l — E\_\ — hv = he/X, we see that 

XI = 2nlc/h = 2njLr^c/h (9.110) 


The observed values of XI arc given in Table 9.1. Note that the constant value of this 
parameter is consistent with our treatment of the molecule as a rigid rotator. 9 From 
these values we can deduce that for HC1 the internuclear distance r () = 1.27 A. This is 
an example of how we can use the information contained in the rotational spectrum 
to learn about the structure of molecules. 

In practice, it is difficult to produce in the far infrared or short microwave region 
the continuous spectrum of radiation that is required for observations in absorption 
of purely rotational transitions, like those shown in Fig. 9.7. However, the combined 
vibrational and rotational energies of a diatomic molecule are given by 



fico + 


/(/ + 1 )tr 
21 


(9.111) 


Figure 9.8a shows an energy-level diagram. If the molecule, like HC1, possesses a 
permanent dipole moment, there is a vibrational selection rule A n v — ± 1 for electric 
dipole transitions. 10 In addition, satisfying the rotational selection rule Al — ±1 


9 For many molecules the separation distance is observed to increase slightly for increasing 
values of /, as you would expect in a centrifuge. It is often possible to observe 40 to 50 rotational 
energy levels between each vibrational level. 

10 We will see how such selection rules arise in Chapter 14. In particular, sec Section 14.5 for 
an example involving electromagnetic transitions between states of the harmonic oscillator. 
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Table 9,1 Rotational absorption transitions in HC1 


Transition 
/- !-*>/ 

X 

(microns) 

v — c/X 
(10 9 Hz) 

v/l 

(10 9 Hz) 

XI 

(cm) 

AV 

(eV) 

(0-» 1)“ 

(479) 

(626) 

(626) 

(0.0479) 

(0.0026) 

1 —> 2 

243 

1235 

618 

0.0486 

0.0051 

2-4-3 

162 

1852 

617 

0.0486 

0.0077 

3^4 

121 

2479 

620 

0.0484 

0.0103 

4^5 

96 

3125 

625 

0.0480 

0.0129 


" This transition is not shown in Figure 9.7. 

Source: A. P. French and E. F. Taylor, An Introduction to Quantum Physics, 

W. W. Norton, New York, 1978, p. 492. 

leads to the set of allowed vibration-rotation frequencies shown in Fig. 9.8b. These 
frequencies are in the easily accessible infrared part of the spectrum (see Fig. 9.9). 

In concluding this discussion of diatomic molecules, we should note that the 
small energy spacing between the rotational levels makes diatomic molecules inter¬ 
esting low-temperature thermometers. In 1941 A. McKellar observed absorption ol 
light coming from the star f Ophiuchi by an interstellar cloud containing cyanogen 
radicals. 11 In the CN molecule there is a transition at 3874 A from the ground elec¬ 
tronic configuration to an excited electronic configuration. Just as with vibrational 
transitions, this change in electronic states may be accompanied by a change in the 
rotational level of the molecule as well. For CN the / = 0 ground state and / = 1 slate 
are separated in energy by £ /=1 — £ /=0 = hej'k with the wavelength X — 2.64 mm 
McKellar’s observations of the relative strengths of two absorption lines, one from 
the / — 0 state and the other from the / = 1 state, allowed him to deduce a population 
for the / = 1 rotational level that corresponded to the molecule being in a thermal batl 
at temperature T — 2.3 K, if no other special excitation mechanism was present. The 
significance of McKellar’s observation was not appreciated until after 1965, wher 
Penzias and Wilson, using a radio telescope, observed the cosmic background ra¬ 
diation resulting from the initial Big Bang at X = 7.35 mm. The currently acceptec 
temperature for this background radiation is 2.7 K. Subsequent reexamination of the 
CN absoiption spectrum confirmed that no special mechanism for exciting the / = j 
state seemed to be in action and that the background temperature at 2.64 mm was 
consistent with that of the cosmic background radiation. Before high-altitude balloor 
flights took place in the 1970s, observations of the populations of diatomic molecules 
such as CN and CH provided the only information about the blackbody spectrum a 
wavelengths shorter than 3 mm, because the earth’s atmosphere is strongly absorbing 
in this portion of the spectrum. 


11 A. McKellar, Pubis. Dominion Astrophys. Observatory (Victoria, B.C.) 7, 251 (1941). 
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Figure 9.8 (a) Vibrational-rotational transitions for a diatomic molecule, (b) A schematic- 
diagram of the resulting spectrum. 



Figure 9.9 A vibrational-rotational absorption spectrum of HC1. Each peak is double 
because of the presence of two isotopes of chlorine in the gas—' 5 C1 and the less abundant 
' 7 C1. Data are from M. Liu and W. Sly, Harvey Mudd College. 
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9.8 Position-Space Representations of L in 
Spherical Coordinates 


Since the orbital angular momentum operators generate rotations in position space 
we can determine position-space representations of these operators. Given the fac 
that we are dealing with orbital angular momentum, it is probably not surprising 
that the most convenient position-space states are expressed in spherical coordinates 
where the angles appear explicitly. Note that the bra equation corresponding to the 
ket equation (9.75) is 

(r, 0, tp\R*(dtpk) = (r, 9,tp + dtp\ (9.112 

Since the rotation operators are unitary, RR~ = 1, R^ is the inverse of R, and therefore 

{r,0,tp\R(dtpk) = (r,9,tp-dtp\ (9.113 


Thus 


(r, 9, tp\R(dtpk)\\p) = (r, 9, tp - d<f>\f) 


= (r, 9, tp\f) - dtp 


Hr,e,w) 


d<p 


(9.114 


where the last step comes from expanding the wave function (r, 6, <p — dtp\tp) in ; 
Taylor series. Since 


(r, 9, tp\R(dtpk)\i!/) = (r, 9, <p\ 


('-H 




(9.115 


we see that 


(r, 9, <p\L z \tfr) = 9, cp\f) (9.116 

/ dtp 

Thus the z component of the orbital angular momentum operator is represented ii 
position space by 


L 


Z 


HJL 

i dtp 


(9.117 


which should be compared with the representation of the linear momentum operator 


Px 


hd_ 
i dx 


(9.118 


The important thing to note is that the orbital angular momentum is represented b; 
a differential operator in position space. This has profound consequences. To help se< 
why, let’s return to (9.116) and consider the special case where |i jr) — \l, m), namely 
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the slate is an eigenstate of orbital angular momentum. Since L,\l. m) — mh\l. m), 
we have 

h 

(r, 0, 0|L.|/, m) = - — (r, 0, 0| l, m) 
i <10 

= mh(r,e,<t>\l,m) (9.119) 

Solving this differential equation, we find that the 0 dependence of an eigenfunction 
is of the form e'""*’. We must require that the eigenfunctions be single-valued: 

— e im((p+2x) (q | 20) 

Otherwise, how would we determine the derivative of a wave function at angle 0 
when the wave function approaches different values depending on the direction from 
which we approach 0? Also, recall from (2.78) that for L, to be Hermitian, it must 
satisfy 

WL Z \X) = (x\L\rP)* (9.121) 

which in position space becomes 12 

0 _T r “I * 

r%0 0*(0)-^-x(0)= r* d<t>x*(<t>)-4i'i'w ( 9 - i22 > 

JO I 00 JO I 0(p 

As integration by parts shows, for (9.122) to hold for all 0(0), the wave functions 
must satisfy 0(0) = 0(2^). Note that if L, were not Hermitian, the rotation operator 
/?(0k) would not be unitary, and probability would not be conserved upon rotation 
of a state. 

This single-valuedness requirement (9.120) is satisfied only if 

m=0,±l,±2,... (9.123) 

But from our general analysis of angular momentum in Chapter 3, we know that the 
m value for a state with a particular / value runs from / to —l in integral steps. Thus 
the values of / must also be integral: 

/ = 0,1,2.... (9.124) 

as we also saw in the spectrum of the diatomic molecule. The fact that orbital angular 
momentum has representations in position space has restricted the possible values 
of the angular momentum quantum number / to integer values. In particular, no 
half-integral values are permitted, unlike intrinsic spin angular momentum. 

12 We ignore all but the 0 dependence for notational simplicity. 
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The position-space representations in spherical coordinates of the other compo¬ 
nents of the angular momentum are not as simple as that for L.\ only for rotations 
about the z axis is there a single angle that can be used to express the rotation 
as in (9.113). There are a number of techniques that can be used to determine 
the position-space representations for L x and L y . One approach is to express the 
position-space representation 

*■ 

L = rxp—>rx- V (9.125) 

i 


in Cartesian coordinates and then make a change of variables to spherical coordi¬ 
nates. However, it is a little easier to express the gradient in spherical coordinates 
directly: 


L 


h ( a 19 

ru i X T U,— -Ml*- — +U 0 
i \ dr r 66 

n ( d 1 


r sin 0 30 / 


(9.126) 


Taking the x and y components of the unit vectors u# and u„>, we obtain 


? ft / . , 9 „ 3\ 

L x —► T I — sin 0— — cot 9 cos 0-— 1 
i \ 99 90 / 

h ( a a \ 

L y -> — ( cos0-cot# sin0— I 

i \ 9 9 90/ 


(9.127) 

(9.128) 


We can then combine the results (9.117), (9.127), and (9.128) to obtain the repre¬ 
sentation of L 2 . 


L 2 


- ti 2 


i a /. a\ i a 2 ‘ 

—-S1H0- -I-r-- 

_sin#90 V 99/ sin 2 0 30-. 


(9.129) 


This form for L : may re min d you of the angular part of the Laplacian in spherical 
coordinates. In fact, if we express the energy eigenvalue equation in position space 
in terms of the Laplacian: 


(f|t— + V' (|r|)|0) = 
2 fi 


ft 2 .2 


r) 


V 2 + V(r) (r|0) = £<r|^> (9.130) 

L 2/r 


and then express the Laplacian in spherical coordinates, we obtain 


/ h 2 a 2 2 a i r i a / . 1 a 2 ' 

(-—- -t-1——-I sm 9 — ) -+■ —t-- 

\ 2 n dr- r dr r 1 [sin 9 39 \ 99/ sin 2 9 9(f>-_ 


) 


+ V(r) (r|0) 


= £(r|0) 


(9.131) 


This equation agrees with (9.91). provided we make the identification (9.129). 
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EXAMPLE 9.2 It is often said that the electron’s intrinsic spin angular 
momentum arises from the electron spinning about an axis, much as the 
Earth has spin angular momentum about an axis running through the North 
Pole. This spin angular momentum for the Earth is in addition to the orbital 
angular momentum the Earth possesses as it revolves about the Sun. Explain 
why a similar situation does not apply to an electron in, say, the hydrogen 
atom, where the electron has both spin and orbital angular momentum. More 
specifically, why can we be assured that at some point in the future we will 
not determine a physical origin for the electron’s intrinsic spin in which the 
electron’s spin angular momentum arises from the electron literally spinning 
about some axis? 

SOLUTION If the electron were spinning about an axis, say the z axis, the 
angular momentum would be orbital angular momentum, no matter what 
name we choose to give to it. In this section we have seen that the eigenvalues 
of L z , the generator of these rotations, take on solely integral multiples of 
h. For the electron, S z — ±71/2, namely a half-integral multiple of h. Thus 
the spin angular momentum of the electron is not simply orbital angular 
momentum with a different name. It is, of course, real angular momentum 
nonetheless. 


9.9 Orbital Angular Momentum Eigenfunctions 


One of the most evident features of the position-space representations (9.117), 
(9.127), and (9.128) of the angular momentum operators is that they depend only 
on the angles 9 and <p, not at all on the magnitude r of the position vector. Rotating 
a position eigenstate changes its direction but not its length. Thus we can isolate 
the angular dependence and determine {9, <p\l, m) , the amplitude for a state of 
definite angular momentum to be at the angles 9 and <j>. These amplitudes, which 
are functions of the angles, are called the spherical harmonics and denoted by 

{9, <p\i, m) = Y Lm (9, 4>) (9.132) 

Expressed in terms of these amplitudes, the energy eigenfunctions of the Hamilto¬ 
nian (9.57) are given by 

(r, 9 , <P\E, l, m) = R{r)Y l m {9, </>) (9.133) 

We might have been led to an expression such as (9.133) for the eigenfunctions 
by solving the partial differential equation (9.131) by separation of variables. Here, 


Page 348 (metric system) 





332 | 9. Translational and Rotational Symmetry in the Two-Body Problem 


however, we have been guided by the rotational symmetry of the problem to write 
the angular part of the eigenfunction ©(<9)4>(0) = Y im (9, <fi) directly. 

Let’s address the question of how we should normalize these eigenfunctions. 
Clearly, we want 

j d\ |{r, 9, <P\E, /, m}| 2 = f d\ \R(r)\ 2 \Y^ m (9, 0)[ 2 = 1 (9.134) 

since the probability of finding the particle somewhere in position space should sum 
to one. 13 The differential volume element c/V in spherical coordinates is given by 

c/Y = r 2 dr sin 9 dO d(p — r 2 dr dQ (9.135) 


where the solid angle dQ. = sin 9 d9 d(j> is the angle subtended by the differential 
surface area dS shown in Fig. 9.10a. Notice that the definition of solid angle is 
in direct analogy with the definition of ordinary angles in radian measure as the 
angle subtended by the differential arc length shown in Fig. 9.1 Ob. The solid angle 
subtended by a sphere (or any closed surface) is 


[ dQ= f 

/all directions JO 



sin 9 d9 — An 


Since we want 


(9.136) 



r 2 dr |tf(r)| 2 



sin 9 d9 



d<P |T/ „,(0, 0)| 2 = 1 


(9.137) 


we can choose to normalize separately the radial and the angular parts of the 
eigenfunction: 



r 2 dr |/£(r)| 2 = 1 


(9.138) 


so that the total probability of finding the particle between r — 0 and r — oc is one, 
and 



sin 9 d9 



d(t>\Y Lm (9,<P)\ 2 = 1 


Thus we can interpret 


\{9,<p\Lrn)\ 2 dQ = \Y Lm (9,cP)\ 2 dQ 


(9.139) 


(9.140) 


as the probability of finding a particle in the state |/, m) within the solid angle dQ 
at the angles 9 and 0. 


13 We are assuming that we are interested in states that yield a discrete energy spectrum, as 
opposed to a continuous eigenvalue spectrum, which would require Dirac delta function normal¬ 
ization similar to that which we used for the position and momentum eigenstates. We will consider 
the continuum solutions to the Sehrodinger equation when we come to scattering in Chapter 13. 
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Figure 9.10 (a) The solid angle d£l in three dimensions is 
defined as the surface area dS subtended divided by the radius 
squared: dS/r 2 = (r dO)(r sin $ d<p)/r 2 — dQ. (b) The ordinary 
angle dtp in two dimensions is defined as the arc length ds 
subtended divided by the radius: dsjr = d(j>. 


To obtain the orbital angular momentum eigenfunctions Yf m (9, 4>) themselves, 
we start with the equation 14 

L + \l,l) = 0 (9.141) 

Since L± = L x ± iL y , using (9.127) and (9.128), we can represent the raising and 
lowering operators in position space by the differential operators 

L ± -+ - e ±i4> (±i — - cot 0 —) (9.142) 

i V 3 9 d<t>J 

Thus in position space (9.141) becomes 

(0, <p\L + \l, l) = -J* (i— -cotP—) (0. 4>\l, l)= 0 (9.143) 

i \ 89 3 <p / 

Inserting the known e llrl> dependence, we can solve the differential equation 

<0,0|/,Z)=O (9.144) 

to obtain 

(9, (j>\l,l) = casin'<9 (9.145) 


{h~ Uate ) 


14 This approach is similar to the one we use in Section 7.4 for determination of the position- 
space eigenfunctions of the harmonic oscillator. For an alternative technique in which the spherical 
harmonics are determined by solving a second-order partial differential equation by separation of 
variables, see Problem 9.17. 
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Satisfying the normalization condition (9.139), we find 

(9, 0|/, /) = Y,j(0, 0) = (zljl / (2/ + 1)! g -^ ^ 0 ( 9.i46) 

I'll V 47r 

We now apply the lowering operator to detennine the remaining spherical har¬ 
monics. From Chapter 3 we know that 

L_\l, m) = v//(/ + 1) - m(m - 1) fi\l, m - 1> (9.147) 

Combining (9.146) and (9.147), wc find (see Problem 9.18) form > 0 


jl—m 


y, mie . 0 ) = <=* (2, + w +m y. _ 

2'/! V 4rr(/ —m)! sin"' 0 t/(cos #) ,_ 


sin 27 0 (9.148) 


The choice of the phase factor (—1) ; is taken to ensure that T/ o (0, 0). which is 
independent of 0, has a real positive value for 9 = 0. In fact. 


Y,, 0 (e, 0) = P ,(cos 9) (9.149) 

V 4jr 

where P/(cos 0) is the standard Legendre polynomial. The spherical harmonics for 
m < 0 are given by 


Y L . m (9. 0 )=(-im,„,(0,0)r 

It is useful to list the spherical harmonics with / = 0. 1. and 2: 


(9.150) 


r o .o(0. ~ 

JI 

(9.151) 

y 

ii_ 

--V 

3; 

II 

=Fsin 9 

V 8,t 

(9.152a) 

Y l0 (9, 0) = 

.1 — COS# 

V 4lT 

(9.152b) 

y 2>±2 (0,0) = 

J 15 e ±2i *sin 2 9 

V 32t r 

(9.153a) 

Y z ±i(e, 0) = 

4:,/ — e*"* sin 9 cos 9 

V 8;r 

(9.153b) 

Y 2 ,o( 0 , 0) = 

,/ —(3 cos 2 # - 1) 

V 16jt v 

(9.153c) 

Figure 9.11 shows plots of | Y{ m (9, 0)| 2 as a function of 9 and 0. Since the 
spherical harmonics depend on 0 through these plots are all independent 

of 0. The / = 0 state, often called an s state, is spherically symmetric. Thus if a 
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Figure 9.11 Plots of \Y l m (0. $)| 2 for / = 0. 1. 2. and 3. 


rotator, such as the diatomic molecule discussed in Section 9.6, is in an s state, a 
measurement of the orientation of the rotator is equally likely to find it oriented in 
any direction. The / = 1 states are known as p states. The states with m — ±1 have 
a probability density that tends to reside in the x-y plane, which is just the son of 
behavior that you might expect for an object rotating around the z axis with nonzero 
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angular momentum. This effect becomes more pronounced for the maximum rn 
values with increasing values of /. The / = 1, m = 0 state is often referred to as 
a p z state, or orbital; the probability density is oriented along the z axis. Since 
z — r cos 9 , the function K l0 may be expressed as 


Using x = r sin 9 cos <p and y = r sin 9 sin (p. we see that 




(9.154) 


(9.155) 


Thus we can find linear combinations of the ±1 states, namely. 


ai-i-ru) 

72 



(Oil+ *!-!> 
72 


[Ty 
V 4nr r 


(9.156) 


that can naturally be termed p x and p v orbitals, with probability densities oriented 
along the x and y axes, respectively. 

In Chapter 10 we will solve the hydrogen atom and see how the principal quantum 
number n enters the energy eigenvalue equation. In Chapter 12 we will examine how 
quantum mechanics allows us to understand the valence properties of multielectron 
atoms as well. It is worth getting a little ahead to point out that the directional 
properties of molecular bonds are to a large extent determined by the shape of 
the orbital angular momentum eigenfunctions. For example, oxygen, with eight 
electrons, has two electrons in n = 1 s states, two in n = 2 s states, and four in 
n — 2 p states. As the electrons fill up these p slates, the first three go into the p x , 
p Y , and p z states. This tends to keep the electrons apart, minimizing their Coulomb 
repulsion. The fourth electron is forced to go into one of these p states, say the p 2 
state, leaving the p x and p y states with one electron each. When oxygen binds with 
hydrogen to form H 2 0, each of the hydrogen atoms shares its electron with oxygen, 
helping to fill the oxygen n = 2 p x and n = 2 p y states. Thus the two hydrogen 
atoms in the water molecule should make a right angle with respect to the oxygen. 
Actually, since the hydrogen atoms are sharing their electrons with the oxygen atom, 
each ends up with a net positive charge and these positive charges repel each other, 
pushing the hydrogen atoms somewhat further apart. The observed angle between 
the hydrogen atoms turns out to be 105°. 

Lastly, we reexamine our old friend the ammonia molecule, NH 3 . Nitrogen has 
seven electrons; as with oxygen, three of them arc in p x , p y , and p, states. Thus 
nitrogen has room for three additional electrons, which it acquires from the three 
hydrogen atoms in forming the NH 3 molecule. The three hydrogens should all come 
out at right angles to each other, but here again the repulsion between the hydrogen 
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atoms forces the angles to be somewhat larger than 90°. We now' see why the 
ammonia molecule is not a flat, or planar, structure, a necessary precondition for 
the tunneling action of the nitrogen atom that w r e discussed in Chapter 4. 


EXAMPLE 9.3 A molecule in the p x orbital is placed in a magnetic field 
B = fi 0 k. Show that the orbital precesscs about the 2 axis. 

SOLUTION The angular wave function at t = 0 is given by 


( 0 , <f >\^ (0) > = sin6 cos 0 = —|=(T) - f - K, ,) 

* 

The Hamiltonian is 

H = -ft B = — ( —l) • B = co 0 L, 

\ 2 m e c ) 

where co 0 — eB 0 /2m e c. Therefore 


= ~= - e- iw °'Y u ) 


s/2 

J_ 

s/2 


sin 6e + J— sin 
\ $71 V 8 n 


= . — sin 6 cos(<£ — co$t) 
4 7T 


Thus the period of precession is T — 27 t/co 0 . When t = T/ 4, for example, 
the wave function is 


< 0 , *|*(7/4)) = 

namely, the p v orbital. This rotation of the orbital angular momentum state 
of the molecule is the same precession that we saw in Section 4.3. where we 
examined the precession of the spin state of a spin-^ particle in an external 
magnetic field. 


3 / 3 13 v 

— sin 6 cos (0 — tc/2 ) = . — sin 0 sin <t> = . - 

4rr V 4.7 V 4tt r 


9.10 Summary 


Physicists have learned a lot about nature by analyzing the two-body problem. In 
this chapter we have focused on two bodies that interact through a central potential 
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that depends only on the magnitude |r| = |r, - r 2 | of the distance separating the two 
bodies. In this case there arc a number of symmetries of the system that we can take 
advantage of to determine the eigenstates of the Hamiltonian. First, the Hamiltonian 
is invariant under translations of both of the bodies (rj T| + a. r-, r 7 + a), 
and therefore the total momentum operator P = p, + p 2 , the generator of total 
translations, commutes with the Hamiltonian. Thus the Hamiltonian and the total 
momentum operator have eigenstates in common. 

It is common to work in the center-of-mass frame where P = 0. in which case 
we can restrict our analysis to the relative Hamiltonian 

~ 2 

ff = £" + V(l?l) (9.157) 

2fi 

which is invariant under rotations. The operator 

R(d<t> k) = 1 - l -L z d<j) (9.158) 

n 

that rotates position states about the z axis: 

R(d(j)k)\x, y, z) = \x — y d(p, y + x d<j>, z) (9.159) 


has a generator 


L z = xp y -y p x 


(9.160) 


which is the z component of the orbital angular momentum. Thus the invariance 
of the Hamiltonian under rotations means the relative orbital angular momentum 
operators L^rxp that generate these rotations commute with the Hamiltonian. 
We therefore deduce that H, L 2 , and one of the components, say L z , have eigenstates 
| E, /, m) in common. In position space, the energy eigenvalue equation is given by 


(r\H\E, /, m) 



/a 2 2d\ 1(1 +I)n 2 

\3r 2 r dr) 2pr 2 


+ V(r) 


(r\E,l,m) 


— E (r|£i, /, m) 


(9.161) 


where /(/ + l)h 2 is the eigenvalue of L 2 . 

The operator L : is represented in position space by 

(9A62) 

i dtp 

and the <p dependence of ( 9 , <p\l. m) — Y t ,„(9, <p), the orbital angular momentum 
eigenfunctions, is given by e‘ m In order that the action of differential operators 
such as (9.162) be well defined, the position-space wave functions must be single¬ 
valued, and therefore the m values must be integral. Since m runs from —l to l in 
integral steps, I must be integral as well. Thus, we see here clearly that the intrinsic 
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spin of a spin-4 panicle, for example, does not arise from the body literally spinning 
about an axis, for if it did, the spin angular momentum would necessarily be of the 
r x p type, and the angular momentum operators that generate rotations of this body 
could be represented by differential operators in position space, which cannot lead 
to half-integral values for /. 

In our analysis of the two-body problem, we have started with a Hamiltonian that 
exhibits translational and rotational symmetries and used these symmetries to deter¬ 
mine its eigenstates. Since \H . F| = 0 and [H. L] = 0. these symmetries also tell us 
that the total momentum and the relative orbital angular momentum of the system 
are conserved. The importance of this connection between symmetries and conserva¬ 
tion laws becomes really apparent if you continue your study of quantum mechanics 
through quantum field theory. There you will see, for example, how conservation 
of charge and conservation of color can actually be used to determine, through 
symmetry principles, the form of the laws governing electromagnetic (quantum elec¬ 
trodynamics) and strong (quantum chromodynamics) interactions, respectively. 

Problems 


9.1. Follow the suggestion after (9.11) to show that [7" (a v i), f(«Ji] = 0 implies 
f/V Py] = 0. 

9.2. What is the generalization to three dimensions of the Fourier transform rela¬ 
tionship (6.57)? 

9.3. Explain the nature of the symmetry that is responsible for conservation of 
energy. 

9.4. Use the commutation relations of rj, r 2 , Pi, and p 2 to establish that the center- 
of-mass and relative position and mornenmm operators satisfy 

lie,-, Pjl = 0 [X h Pj] = ihSij 

[Xi, p j ] = ihS ij [X h pj] = 0 

9.5. Show explicitly that 

iL + il = li+pi 

2/M] 2 m 2 2 M 2p 


where 


m,p I -m 1 p 2 t> ~ * ha , , m x m 2 

p__-£j-P = Pi+P 2 M — m\+m 2 and p = 


m [ + m 2 


m i -I- m 2 
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9.6. Use the fact that all vectors must transform the same way under rotations to 
establish that any vector operator V must satisfy the commutation relations 

[L z , V x ] = ihVy [L z , V y ] = -ihV x [L z , V z ] = 0 

9.7. Use the identity 

3 

i=l 


together with the commutation relations (9.19) of the position and momentum 
operators and the expression (9.82) for the orbital angular momentum operators to 
verify that 

L 2 = rxprxp = r 2 p 2 — (r • p ) 2 + iflr ■ p 


9.8. Use the commutation relations [x h pj ] = ihSjj to verify that the angular mo¬ 
mentum operators L — r x p, or, in component form, 

3 3 

£«■ = £ Vjh 

j =1 k=l 

satisfy the commutation relations 


3 

Lj] = ih ^ ^ U jk^k 
k =1 

9.9. The carbon monoxide molecule. CO, absorbs a photon with a frequency of 
1.15 x 10 n Hz, making a purely rotational transition from an / = 0 to an l — 1 energy 
level. What is the intemuclear distance for this molecule? 


9.10. The energy spacing between the vibrational energy levels of HC1 is 0.37 eV. 

(a) What is the wavelength of a photon emitted in a vibrational transition? 

(b) What is the effective spring constant k for this molecule? 

(c) What resolution is required for a spectrometer to resolve the presence of 
H 3:, C1 and H 37 C1 molecules in the vibrational spectrum? 

9.11. The ratio of the number of molecules in the rotational level /, with energy E h 
to the number in the / = 0 ground state, with energy E 0 , in a sample of molecules in 
equilibrium at temperature T is given by 

(21 + l)e -(£j - £ o)/*BT 


where the factor of 21 + 1 reflects the number of rotational states with energy E h 
that is, the degeneracy of this energy level. 
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(a) Show that the population of rotational energy levels first increases and then 
decreases with increasing /. 

(b) Which energy level will he occupied by the largest number of molecules for 
HC1 at room temperature? Compare your result with the intensities of the 
absorption spectrum in Fig. 9.9. What do you deduce about the temperature 
of the gas? 

9.12. The wave function for a particle is of the form (Hr) = (x + y + z)f(r). What 
are the values that a measurement of L 2 can yield? What values can be obtained 
by measuring L{1 What are the probabilities of obtaining these results? Suggestion: 
Express the wave function in spherical coordinates and then in terms of the Y t m ’s. 

9.13. A particle is in the orbital angular momentum state |/, m). Evaluate A L x and 
A L y for this state. Which states satisfy the equality in the uncertainty relation 

AL x AL y > ~\(^z)\ 

Suggestion: One approach is to use L x = (L + + L_)/2, and so on. Another is to take 
advantage of the symmetry of the expectation values of L 2 and L 2 in an eigenstate 
of L z . 

9.14. Use the position-space representations of the orbital angular momentum op¬ 
erators L x , L v , and L z given in (9.117), (9.127), and (9.128). respectively, to derive 
the position-space representation of the operator L 2 given in (9.129). 

9.15. Show that the spherical harmonics Y l m are eigenfunctions of the parity oper¬ 
ator with eigenvalue (— l/. Note: An inversion of coordinates in spherical coordi¬ 
nates is accomplished by r —► r, 9 -> n — 0, and </> -*■ <f> + .t. Use (9.148). It may 
be wise to check the specific examples in (9.151) through (9.153 1 before attempting 
the general case. 

9.16. 

(a) Obtain Y x 0 by application of the lowering operator in (9.142 1 to Y . 

(b) By direct application of the operator L 2 in position space [see (9.129)1 verify 
that Fj i is an eigenfunction with eigenvalue 2 fr. 


9.17. Determine the spherical harmonics and the eigenvalues of L 2 by solving the 
eigenvalue equation L 2 |A, m) = A/i 2 |A, m) in position space. 


-h 2 


_J_9_ 

sin 0 iW 



_L _ c^_~ 

sin 2 6 34 > 2 . 


= kh 2 Q X 'm(0)e im * 
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Note that we have inserted the known <p dependence. To illustrate the procedure, 
restrict your attention to the m = 0 case. Rewrite the equation in terms of u — cos d 
and show that it becomes 


(1 t* )■ 


d 2 e 


x,o 


-2u- 


dG 


A,0 


+ A©x.o — 0 


du 2 du 

which you may recognize as Legendre’s equation. Try a power series solution 


DC 

0A,O — X/ 

k =0 

Show that the series diverges for |u | —*- 1 (6 -*■ Oor# —*• n) unless a = /(/ + 1) with 
1 = 0, 1, 2, ..., in which case the series terminates. The solutions 0, 0 (n) — P/(u) 
are just the Legendre polynomials. Compare the first few solutions with the spherical 
harmonics listed in equations (9.151), (9.152b), and (9.153c). 


9.18. Apply the lowering operator to Y u as given in (9.146) to determine !//_,. 
Check your result against the general expression (9.148) for the I) s. 

9.19. In a diatomic molecule the atoms can rotate about each other. This rotation can 
be shown to be equivalent to a reduced mass fi = m \m 2 /(m ( 4- rn 2 ) rotating in three 
dimensions about a fixed point. According to classical physics, the energy of a three- 
dimensional rigid rotator is given by E = L 2 /2/, where / is the moment of inertia 
and L 2 = Z. 2 + L 2 + L 2 is the magnitude squared of the orbital angular momentum. 

(a) What is the energy operator for this three-dimensional rotator? What are the 
energy eigenstates and corresponding energy eigenvalues? Take the moment 
of inertia I to be a constant. (A complication is that molecules “stretch" as 
they rotate faster, so the moment of inertia is not a constant.) Show in the limit 
of large / (/ » 1) that 


Ei 

which means that the discrete nature of the energy levels becomes less appar¬ 
ent as l increases. 

(b) Determine the frequency of the photon that would be emitted if the rotator 
makes a transition from one energy level labeled by the angular momentum 
quantum number / to one labeled by the quantum number 1 — 1. 

(c) Show in the limit in which the orbital angular momentum quantum number / 
is large (/ » 1) that the frequency of the photon from part (b) coincides with 
the classical frequency of rotation of the rotator (recall L = / a>). This result 
illustrates the correspondence principle, namely how the results of classical 
physics and the results of quantum mechanics can coincide in the appropriate 
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limit. It also shows how wc might use the correspondence principle to deduce 
the existence of a selection rule, in this case Al = ±1. Note: Here A/ means 
the change in /, not the uncertainty in I. 


9.20. The wave function of a rigid rotator with a Hamiltonian H — L 2 /2 / is given by 

(0, 0|0(O)) = sin 6 sin 0 

V 4jt 

(a) What is (0, 0|0(r))? Suggestion: Express the wave function in terms of the 

(b) What values of L. will be obtained if a measurement is carried out and with 

what probability will these values occur? , 

(c) What is (L x ) for this state? Suggestion: Use bra-ket notation and express the 
operator L v in terms of raising and lowering operators. 

(d) If a measurement of L x is carried out. what result(s) will be obtained? With 
what probability? Suggestion: If you have worked out Problem 3.15, you can 
take good advantage of the expressions for the states 11. m) x . 

9.21. Suppose that the rigid rotator of Problem 9.20 is immersed in a uniform 
magnetic field B = B 0 k. and that the Hamiltonian is given by 

- L 2 

H =-P u)r]L 7 

21 

where a> 0 is a constant. If 

(6. 0|0(O)) = J — sin 0 sin 0 

V 4 7t 

what is ( 9 , 010(r))? What is { L x ) at time r? 


9.22. Treat the ammonia molecule, NH 3 , shown in Fig. 9.12 as a symmetric rigid 
rotator. Call the moment of inertia about the z axis / 3 and the moments about the 
pair of axes perpendicular to the z axis 7j. 

(a) Express the Hamiltonian of this rigid rotator in terms of L, l\, and / 3 . 

(b) Show that \H, L z ] = 0. 

(c) What are the eigenstates and eigenvalues of the Hamiltonian? 

(d) Suppose that at time t = 0 the molecule is in the state 


|0) = "7=|0, 0) + -J=|l. 1> 

sfi V2 


What is 10(/)) ? 
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z 


i 



Figure 9.12 The ammonia molecule. 


9.23. The Hamiltonian for a three-dimensional system with cylindrical symmetry 
is given by 

rv -) 

H = ^ + V(p) 

2/r 

where p = y/x 2 + y 2 . 

(a) Use symmetry arguments to establish that both p,, the generator of transla¬ 
tions in the z direction, and L z , the generator of rotations about the z axis, 
commute with H. 

(b) Use the fact that H, p : , and L. have eigenstates in common to express the 
position-space eigenfunctions of the Hamiltonian in terms of those of p z 
and L,. Suggestion: Follow a strategy similar to the one that we followed 
in (9.94) for a spherically symmetric potential except that here we arc using 
the eigenfunctions of p, and L : instead of the eigenfunctions L : and L z . 

(c) What is the radial equation? Note: The Laplacian in cylindrical coordinates 
is given by 


P ap 


di{/\ 1 d 2 \If d 2 \j/ 

dp ) p 2 dtp 2 dz 2 
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Bound States of Central Potentials 




In this chapter we solve the Schrodinger equation for the bound states of three 
systems—the Coulomb potential, the spherical well, and the three-dimensional 
harmonic oscillator—for which the potential energy V = V (r). Figuratively, the 
Coulomb potential forms the centerpiece of our discussion. The exact solution of the 
two-body problem with a pure Coulomb interaction serves as the starting point for a 
detailed comparison in both this chapter and the next between theory' and experiment 
for the hydrogen atom, a comparison that gives us much of our confidence in quantum 
mechanics. 

10.1 The Behavior of the Radial Wave Function Near the Origin 


In Chapter 9 we saw that for a potential energy with spherical symmetry', we can 

X a X 

write the energy eigenfunctions as simultaneous eigenfunctions of L- and L.: 


<r| E,l, m) = R Eil (r)Y[ tin (9, <f>) 


( 10 . 1 ) 


The Schrodinger equation for the radial wave function is given in (9.96), 



/ cP_ + 2 d_\ 
V^r 2 r dr J 


l (l + 1 )h 2 
+ 2 nr 2 


+ V(r) 


R £,/(>■) — ERejO") 


( 10 . 2 ) 


where we have inserted subscripts indicating explicitly that the radial wave function 
depends on the values for E and 1. The lack of dependence of (10.2) on mh means 
that each energy state with fixed E and fixed / will have at least a 2/ + 1 degeneracy. 
As we did in Chapter 9, it is convenient to make the substitution 


Re, i O') 


»£,/(?•) 

r 


(10.3) 
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in which case (10.2) becomes 

[-tS + +w] ( |0 ' 4 » 

Expressed in terms of u £ /(r), the normalization condition (9.138) for bound states 
is given by 1 

f r 2 dr R* ei R e i = f dru* El u EJ = 1 (10.5) 

J o ' Jo 

Thus, as we remarked earlier. (10.4) has the same form as the one-dimensional 
Schrodinger equation (9.98) except that the variable r runs from 0 to oc, not from 
—oo to oo. This naturally raises the question of what happens to the wave function 
when r reaches the origin. 

Provided the potential energy V(r) is not more singular at the origin than l/r 2 , 
the differential equation (10.4) has what is known as a regular singularity at r = 0, 
and we are guaranteed that solutions in the form of a power series about the origin 
exist. To determine the leading behavior of u E j(r) for small r. we substitute r s into 
(10.4): 

-—S(S - 1 )r s ~ 2 + /(/ + 1)fi V ~ 2 + V (r)r s = Er ' (10.6) 

2n 2/x 

Notice that the r v-2 terms dominate for small r if V (r) is less singular than l/r 2 , 
that is, 

r 2 V(r) ->0 (10.7) 

r—o 

Satisfying (10.6) for small r requires the coefficients of r s-2 to obey 

-sOi- l)+/(/+ 0 = 0 (10.8) 

which shows that s = / -f 1 or s = — /. However, we must discard those solutions that 
behave as r~ l for small r. For 1 > 1, these solutions cannot satisfy the normalization 
condition (10.5) because the integral diverges at the lower limit. For / = 0, the leading 
behavior of u for small r is a constant and the integral (10.5) is finite. But if the 
leading behavior for u is a constant, the wave function R behaves as l/r near the 
origin, which is also unacceptable. To see this, return to the full three-dimensional 
Schrodinger equation in position space (9.130) and note that 2 


1 We restrict our analysis in this chapter to bound states, which have a discrete energy .spectrum 
and can therefore obey this normalization condition. In Chapter 13 we will consider the continuum 
portion of the energy spectrum for central potentials. 

2 One way to verify this result is to integrate both sides of the equation over a spherical volume 
including the origin and use Gauss’s theorem: 
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V 2 - = —47r5 3 (r) (10.9) 

r 

Therefore, a \/r behavior for R cannot satisfy the Schrodinger equation, since we 
are presuming that the potential energy does not have a delta function singularity at 
the origin. Thus we must discard the r~ l solutions for all /; consequently, we deduce 
that the allowed behavior for small r is given by 

u El - yr l+l (10.10) 

’ r—►O 

and hence u E t (r) must satisfy 3 


u El ( 0) = 0 (10.11) 

Notice that as / increases, the particle is less and less likely to be found in the 
vicinity of the origin. Recall that the “one-dimensional” Schrodinger equation (10.4) 
has an effective potential energy 


Vc ff (r) 


/(/ + 1 )h 2 

2/xr 2 


+ V(r) 


( 10 . 12 ) 


The /(/ + 1) /2/ir 2 term, known as the centrifugal barrier, increases with increasing 
/ and tends to keep the particle away from the origin, producing the small-/- behavior 
of the wave functions (see Fig. 10.1). 

The behavior (10.10) implies that the radial wave functions 


R ei - >r' (10.13) 

’ r~* 0 

and thus these wave functions vanish at the origin for all except 1 = 0. or s, states. A 
dramatic illustration is the annihilation of positronium, a hydrogen-like atom where 
the nucleus is a positron, the antiparticlc of the electron, instead of a proton. For 
the electron and the positron to annihilate, they must overlap spatially, which can 
occur only in s states. Positronium is often formed by the capture of a slow positron 
by an electron, generally in a highly excited state of the “atom.” The atom then 
undergoes a sequence of radiative transitions, most often ending in the ground state, 
which we will see in the next section is an / — 0 state, where annihilation of the 
particle-antiparticle pair finally takes place. 


J d 3 r V • V- = ^ dSn ■ V- = 47rr 2 = = f 4 3 r I— 4nS : 


(r)l 


.Another way is to note the solution of Poisson’s equation in electrostatics, V 2 (p = —4 TTp, for a 
point charge q : 

V 2 - = —47r<y<5 3 (r) 
r 

3 See also Problem 10.1. 
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10.2 The Coulomb Potential and the Hydrogen Atom 


The Hamiltonian for a hydrogenic atom is 



Ze 2 


(10.14) 


Of course, the usual hydrogen atom has Z = 1, but by introducing the factor of Z, 
we can also consider atoms with a charge Ze on the nucleus that have been ionized 
so that only a single electron remains. Examples include He + , Li ++ , and so on. The 
radial equation for the function u E i is given by 


l (l + 1 )h 2 
2fx dr 2 2nr 2 



u E j(r) = Eu E j(r) 


(10.15) 


Note that the potential energy of the system is negative because, as is customary, 
we have chosen the zero of potential energy when the two particles in the system 
are very far apart, that is, when r —oo. The potential energy V (r) is then the work 
that you perform to bring the particles from infinity to a distance r apart with no 
kinetic energy. Since the particles attract each other, you do negative work and 
therefore the potential energy is negative, as shown in Fig. 10.1. Classically, the 
two particles are bound together whenever the energy E of the system is negative, 
because in this case there exists a radius beyond which the potential energy exceeds 
the total energy E, which would require negative kinetic energy (see Fig. 10.2). 
In this chapter we will restrict our attention to determining the bound states; in 
Chapter 13 we will examine the significance of the positive-energy solutions when 
we discuss scattering. 




Figure 10.1 (a) The Coulomb potential V(r) = —e 2 /r and the centrifugal 
barrier / (/ + \)h 2 jl^r 2 add together to produce the effective potential energy 
Vat — 1(1 + l)/r/2/.rr 2 + V(r). (b) The effective potential for several values 
of/. 
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V 



Figure 10.2 For a particular value of the total 
energy E, there is a maximum radius beyond which 
the particles cannot separate classically. 


Since we are seeking solutions with negative energy to the differential equation 
(10.15), it is convenient to write E — — |is| and to introduce the dimensionless 
variable 



Expressed in terms of this variable, (10.15) becomes 4 


d 2 u /(/ + !) / A. 

dp 2 P 2 “ + Vp 



where 

. _ Ze 2 nr 
= TV W\ 


( 10 . 16 ) 


(10.17) 


( 10 . 18 ) 


If we try to solve (10.17) through a power-series solution, we get a three-term 
recursion relation. However, in the limit p oo, the equation simplifies to 


which has solutions 


d 2 u 

~dp 2 



u = Ae p/2 + Be p/1 


(10.19) 


( 10 . 20 ) 


We discard the exponentially increasing solution because such a solution cannot 
satisfy the normalization condition (10.5). If we factor out the small-p behavior 


4 Wc suppress the subscripts in this and succeeding equations for notational convenience. The 
factor of 8 in (10.16) is chosen to make (10.17) work out nicely. 
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(10.10) as well as the large-/) behavior, we can attempt to find a solution to (10.17) 
of the form 


u(p) = p l+l e- p/2 F(p) 


( 10 . 21 ) 


in a manner similar to that which we used to solve the harmonic oscillator in 
Section 7.9. 5 

With this substitution, the differential equation (10.17) becomes 



( 10 . 22 ) 


Although this differential equation may seem more complicated than our starting 
point (10.17), it is now straightforward to obtain a power-series solution of the form 


F(p) = W k 

k =0 


(10.23) 


with the restriction that c 0 ^ 0 so as not to violate (10.10). It should again be 
emphasized that we have made no approximations in deriving (10.22). It may appear 
that we have discarded one of the two possible behaviors of the function u for large p, 
but, as we will now see, the exponentially increasing solution will resurrect itself 
unless we make a judicious choice of A. 

Substituting (10.23) into (10.22), we obtain 


OC OO OC 

) ^ k(k — l)CjtP^ ' + ^ (2/ + 2)kc k p k " + y [—A + A — (/ + OJc^p* 1 = 0 


1=2 


*=i 


*=o 


(10.24) 


Making the change of indices k — 1 = A' in the first two summations and then 
renaming k' = k, we can express (10.24) as 

■30 

Y, {[*(* + 1) + (2/ + 2 )(k + l)]c* +1 + \-k + A — (/ + l)lc*} p k ~ ] = 0 (10.25) 

k—0 

leading to 


c k _ t.| k l 1 — A 

c k 4" 1)(A + 21 + 2) 


(10.26) 


5 Factoring out the small-p behavior is really just using the method of Frobenius to solve 
the differential equation by a power series. Before going on, it might be useful to read through 
Section 7.9 again. In particular, notice the discussion from (7.105) through (7.108). Similar 
arguments arc used in this section to generate (10.26). 
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Note that 


c *+t _^ 

Q. *-► 50 k 


(10.27) 


which is the same asymptotic behavior as e p . Thus, unless the series (10.23) termi¬ 
nates, the function u(p) in (10.21) will grow exponentially like e p/2 . To avoid this 
fate, we must have 


X — l + /rW r 


(10.28) 


where 


n r = 0,1,2,... (10.29) 

determines the value of k at which the series terminates. The function F will thus 
be a polynomial of degree known as an associated Laguerre polynomial. 
Quantizing X in (10.28) leads to a quantized energy from (10.18): 


E 


pZ 2 e 4 

2fi 2 (1 + l + n r ) 2 


(10.30) 


Since / and n r are both integers that are greater than or equal to zero, we define the 
principal quantum number n by 


/ + 1 4- n r = n 


(10.31) 


Thus in terms of the principal quantum number 

E n = -^4 » = 1,2,3,... (10.32) 

2 h-n- 

A useful way to express the result (10.32) is to introduce the speed of light c to 
form 



(10.33) 


a dimensionless quantity whose value is approximately 1/137 and is known as the 
fine-structure constant, for reasons that will become apparent in Chapter 1 i , 6 In 
terms of a, the allowed energies are given by 


E 


n 


lxc 2 Z 2 a 2 
2 n 2 


(10.34) 


Equation (10.34) is easy to remember. The quantity' pc 2 carries the units of energy, 
and for hydrogen equals 0.511 MeV. The reason that the atomic energy scale is eV 


6 In SI units, a = <? 2 /4 ns 0 Fic. 
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Figure 10.3 The energy levels of the hydrogen atom 
superposed on a graph of the Coulomb potential 
energy. 


rather than MeV is the small value of a ( a 2 = 5.33 x 10 5 ). Numerically, the energy 
levels for hydrogen (Z = 1) are given by 7 


E 


n 


13.6 eV 


(10.35) 


and are indicated in Fig. 10.3. 

When a hydrogen atom makes a transition from a state with principal quantum 
number n,- to one with n f (n f < n,), the atom emits a photon with energy 


' To reaeh a deep understanding of why the energy scale of atomic physics has the value it docs, 
we need to understand both why /n^c 2 = 0.511 MeV and why or has the value 1/137.0360. Although 
we do not know why elementary particles such as the electron have the masses that they do, the 
actual numerical value for the mass-energy of the electron cannot have any deep significance, since 
it depends on our choice of units, including, for example, the magnitude of the standard kilogram 
that is kept in Paris. The numerical value of ct. on the other hand, is completely independent of 
the choice of units. It is. therefore, a fair and important question to ask why or has this particular 
v alue. If you can provide the answer, you can skip past Chapter 14 and on to Stockholm to collect 
your Nobel prize. 
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Figure 10.4 The visible spectrum of hydrogen, showing the Balmer series. Adapted from 
W. Finkeinburg, Structure of Matter, Springer-Verlag, Heidelberg, 1964. 


hit — E„ 


' n f 


/ic 2 a 2 / 1 

~ w 



and therefore with inverse wavelength 


1 _ fica 2 
X ~ 2 h 



(10.36) 


(10.37) 


The value of the Rydberg for hydrogen, as determined by (10.37). is in complete 
agreement with experiment. Figure 10.4 shows the spectrum of hydrogen produced 
when transitions take place from a state with principal quantum number > 2 
directly to a state with rtf = 2. These transitions, which are in the visible part of 
the spectrum, form the Balmer series. Transitions directly' from excited states to the 
ground state (n /■ — 1) emit more energetic photons in the ultraviolet portion of the 
spectrum, known as the Lyman series, while transitions from excited states to states 
with tif = 3 emit less energetic photons in the infrared portion of the spectrum, 
known as the Paschen series. 

Note from (10.34) that E n m e c 2 , which is consistent with our use of the 
nonrelativistic Schrodinger equation to describe the hydrogen atom. Of course, 
relativistic effects do exist. In Chapter 11 we will see that these effects produce a 
fine structure within the energy levels. There is also a different type of fine structure, 
whose origin is already apparent in (10.34), that is discernible in a typical hydrogen 
spectrum. This structure is due to the existence of an isotope of hydrogen in which 
the nucleus consists of a deuteron, a bound state of a proton and a neutron, instead 
of a single proton. Expressing the reduced mass fj. in terms of the mass M N of the 
nucleus, we see that 


m e c 2 Z 2 tx 2 f 1 \ ^ m e c 2 Z 2 a 2 / _ m e \ 

2 n 2 V1 + m e /M N ) ~ 2n 2 \ M N ) 


(10.38) 


where the last step follows since m e /m N <<i 1. Comparing this expression for hy¬ 
drogen, where M N = m p , with that for deuterium, where the mass of the nucleus is 
roughly twice as large as for hydrogen, we see that the spectral lines of deuterium 
are shifted to slightly shorter wavelengths in comparison with those of hydrogen. 
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For example, the H„ line at 6562.8 A in the Balmer series, corresponding to the 
transition from n ; = 3 to n f - = 2, is shifted by about 1.8 A, while the line at 
4861.3 A, corresponding to the transition from n,- =4to iij =2, is shifted by 1.3 A. 
In naturally occurring hydrogen, this effect can be difficult to see because the nat¬ 
ural abundance of deuterium is roughly 1 part in 7000. However, by increasing the 
concentration of the heavy isotope using thermal diffusion techniques, H. C. Urey 
discovered deuterium spectroscopically in 1932, the same year that the neutron was 
discovered. 8 


EXAMPLE 10.1 What is the ionization energy of positronium in its ground 
state? 

*? 

SOLUTION According to (10.34) with Z set equal to one. 


E„ = — 


i 2 

/zc or 


-n — - 7 

2/i- 


Since the mass of the positron is equal to the mass of the electron, the reduced 
mass for positronium is 


M = 


m x m 2 m e 


m l + m 2 2 
Therefore, the ground energy of positronium is 

13.6 eV 


E , = 


= —6.8 eV 


so it takes 6.8 eV to ionize positronium. 


THE HYDROGENIC WAVE FUNCTIONS 

Since we can specify the energy eigenvalue by specifying the principal quantum 
number n, we can label the energy eigenfunctions by the quantum numbers n, l, 
and m: 


(r|n, /. m) = R nJ {r)Y Lm {0 , 0) = Y t m {6, 0) 

r 

Note that the dimensionless variable p in (10.16) is given by 


/8^|£| 2 Zpca 2Z r 

P = \l —£T- r = Z - r =- 

n- fin n 


(10.39) 


(10.40) 


" H C Urey. F. G. Brickweddc, and G. M. Murphy. Pins. Wei. 40. 1 (1932). Urey received the 
Nobel prize in 1934 for this discovery. 
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where the length 

a 0 =— (10.41) 

Uca 

known as the Bohr radius, is a convenient length scale to use in expressing the wave 
functions. For hydrogen, a 0 has the magnitude 0.529 A. 

The ground state has n = 1 and, consequently from (10.31). 1 = 0 and n r — 0. 
The power series (10.23) terminates after the first term. F is just the constant c 0 , and 
from (10.21) 


«i.o (p) = c 0 pe 


-P/2 


(10.42) 


The normalized radial wave function is then given by 

/ 7 \ -V2 

/?, 0 = 2 y—J e~ Zr/a ° (10.43) 

We should emphasize that the ground state has zero orbital angular momentum. This 
is to be contrasted with the early Bohr model, which preceded the development of 
quantum mechanics, in which the electron was believed to follow a definite orbit in 
each allowed stationary state. Each stale in the Bohr model had a nonzero value of the 
orbital angular momentum, in an attempt to account for the stability of the atom. The 
only way to describe a bound state of zero orbital angular momentum with classical 
trajectories would be to have the electron traveling through the proton in the hydrogen 
atom in straight lines. Of course, as we saw in Section 9.4, quantum mechanics 
accounts naturally for the stability of the ground state through the uncertainty- 
principle. and as we saw in Chapter 8, the concept of a classical trajectory is 
inappropriate for describing the motion of an electron within the atom. In particular, 
see Problem 8.5. 

Let’s examine the higher energy states. The first excited states have n = 2. Here 
we can have l = 0, n r = 1, which means the power series (10.23) has two terms and 
is therefore a first-degree polynomial; or we can have l = 1, n r = 0. which means 
the series (10.23) has only the first term, but in contrast with (10.40), we pick up an 
extra factor of r from the r ,+ l in (10.21). The normalized radial wave functions for 
these states are given by 


Rits ~ 2 {id ( l- f;) 


-Zr/2a 0 


y/3 v 2Uf) / fly 


R 


Zr/2a n 


(10.44a) 


(10.44b) 


The second excited states have n = 3. There are three possibilities: / = 0, n,. = 2, a 
second-degree polynomial for F:l = 1, n r = 1, a first-degree polynomial for F: and 
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— 2, n r — 0. The corresponding normalized radial wave functions are given by 



(10.45a) 


(10.45b) 

(10.45c) 


These are, of course, just the radial wave functions. The complete energy eigen¬ 
functions also involve the spherical harmonics, as indicated in (10.39). As we saw- 
in Chapter 9, these spherical harmonics can give rise to rather involved probabil¬ 
ity distributions as functions of the angles 0 and <p. However, if we ask only for 
the probability of finding the particle between r and r + dr, we must integrate the 
three-dimensional probability density |(r|n, /, m )| 2 over all angles. Since the Y t ,,,’s 
are themselves normalized according to (9.139), we are left with 



sin 9 d6 d<p r 2 dr |<r|£, /, m)\ 2 = r 2 \R n ,{r)\ 2 dr 


(10.46) 


as the probability of finding the particle between r and r + dr. Note that the factor of 
r 2 in the radial probability' density r 2 \R nl (r)\ 2 comes from the volume element c/ V. 
The radial wave functions, as well as the radial probability density, are plotted in 
Fig. 10.5 for the wave functions (10.43), (10.44), and (10.45). 

As we have seen, F ( p ) is a polynomial of degree n r = n — l — 1. Thus it has 
n r radial nodes. The probability density r 2 \R n /(r)| 2 has n — / “bumps.” When, for 
a particular value of n, l has its maximum value of n — 1, there is only one bump. 
Since n r — 0 in this case, the wave function 


R 


n,n—\ 


a r n ~ l e~ Zr / na ° 


(10.47) 


and thus the probability density 

r 2 | | 2 a r 2n e- 2Zr/na ° (10.48) 


The location of the peak in the probability distribution can be found from 

— r 2 \R n „_ 1 | 2 a (in - —r) r 2 »-i e - 2 Zr/™ 0 = 0 (10 . 49) 

dr ’ V «o« / 

which yields 

n ' a ° (10.50) 




As Fig. 10.5 shows, the states with different values of 1 for a given energy do have 
differing radial probability densities. Even though the states with smaller values of / 
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Figure 10.5 Plots of the radial wave function R n j(r) and the radial probability density 
r 2 \R n j(r)\ 2 for the wave functions in (10.43), (10.44), and (10.45). 
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for a particular n have additional bumps in their probability distributions, the average 
position for each of the states with a given n tends to reside in shells of increasing 
radius as n increases. These extra bumps, which occur within the radius (10.50) 
for each shell, play a big role in determining the order that these states fill up in 
a multielectron atom and consequently in determining the structure of the periodic 
table. We will return to this issue in Chapter 12. 


EXAMPLE 10.2 An electron in the Coulomb field of the proton is in the 
state 

i*)=4=u. °> °>+-4i2.1. 1> 

\/2 V2 

"9 

where |«, /, in) are the energy eigenstates of the hydrogen atom. 

(a) What is |^(/))‘? 

(b) What is (£) for this state? What are (L 2 ), ( L x ), (L v ), and (L,)? 

SOLUTION 

p iE\t/fi p i Eil / h 

mn) = —7=—ifi 0. o> + ——i2,1, i) 


V2 


V2 


where the energy eigenvalues E n are given in (10.34). 

fa) 1 i2 1 i2 


(E) = 


?—< E\l/ti 


V2 


(L 2 ) = 


e -i Eit/h 


g —i Eit/h 


s/2 


£2 = 


E\ + E 2 


s/2 


{L z ) = 


(1)(1 + 1 )h 2 = ti 2 

-iEil/li 2 fc 

—=— h=- 

i/l 2 


(L x ) = m)\L x m)) 

= X -('l'(t)\(L + + L_)W(t)) 


1 JE2I/H 

' (1,0,0| + - 


-£( 


V2 


sfl 

(Ly) = (Mt)\Ly\lHt)) 

= l(^(/)|(L + -LJ|^(r)> 
2 1 


'(2, 1, 1|) 


he~ iE i" h |2, 1.0> = 0 


1 / e iE \‘/ n JF.ii/n \ 

= 0 , 01 + — 7 =^( 2 . 1 , 1 | he~ IEl /R \2, 1 , 0 ) = 0 

2t \ V 2 y/2 ) 


pi Ell/n 


Page 375 (metric system) 



10.2 The Coulomb Potential and the Hydrogen Atom | 359 


Note: [L x ) and (L v ) vanish since L x and L y (and hence L + and 
Z_) commute with L 2 and therefore cannot change the / values. And 
of course amplitudes such as (1, 0, 0|2, 1. 0) vanish since angular 
momentum states with different /’s are orthogonal. 

DEGENERACY 

One of the most striking features of the hydrogen atom is the surprising degree of 
degeneracy, that is. the number of linearly independent states with the same energy. 
For each n, the allowed / values are 

/ = 0, 1.n — 1 (10.51) 

and for each /, there are 21 + 1 states specified by the m values. Thus the total 
degeneracy for a particular n is 

^(2/ + 1) = 2- ~ — + n = n 2 (10.52) 

1=0 

The lack of dependence of the energy on / is shown in Fig. 10.6. This degeneracy 
is unexpected, unlike the independence of the energy on the m value, which we 
expected on the grounds of rotational symmetry. This rotational symmetry would 
disappear if, for example, we were to apply an external magnetic field that picks out 
a particular direction, such as the z direction. In that case, the energy would depend on 
the projection of the angular momentum on the z axis. Unlike the rotational symmetry 


H = 4 
n = 3 

n = 2 


n = 


I 


1 = 0 1=1 


1=2 1= 3 


Figure 10.6 The n = 1 through n = 4 energy levels of the hydrogen 
atom, showing the degeneracy. States with / = 0 are called .v states. 
/ = 1 p states, l = 2 A states, I = 3 / states, and from then on 
the labeling is alphabetical. Historically, this nomenclature for the 
low values of / arose from characteristics in the spectrum: sharp, 
principal, diffuse, and /undamental. 
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that is responsible for the degeneracy of the different m states, there is no “obvious” 
symmetry that indicates that states, such as (10.44a) and (10.44b), with different /’s 
should have exactly the same energy. In fact, if we examine the different effective 
potentials shown in Fig. 10.1 for these states, we see how unusual it is, for example, 
that the state with one node and no centrifugal barrier should have exactly the same 
energy as the state with no nodes and an ti 2 /ixr 2 centrifugal barrier. Historically, 
because the reason for the degeneracy of the / values wasn’t obvious, it was often 
termed an accidental degeneracy. 9 

Our discussion of degeneracy has ignored the spin of the electron and the spin 
of the proton, both spin-j particles. Since there arc two electron spin states and two 
proton spin states for each | n, /, m), we should multiply (10.52) by 4, yielding a 
total degeneracy of 4 n 2 . Thus the ground state, for which n = 1, is really four-fold 
degenerate. It is this degeneracy that is partially split by the hyperfine interaction, 
which we discussed in Section 5.2. 


10.3 The Finite Spherical Well and the Deuteron 


Let’s shift our attention from atomic physics to nuclear physics. For the hydrogen 
atom, the excellent agreement between the energy levels (actually the energy differ¬ 
ences) and the spectrum of the photons emitted as the atom shifts from one energy 
level to another provides a detailed confirmation that the potential energy between an 
electron and a proton is indeed — e 2 /r , even on the distance scale of angstroms. This 
is our first serious indication that Maxwell’s equations describe physics on the mi¬ 
croscopic as well as the macroscopic scale. When we examine the simplest two-body 
problem in nuclear physics, the neutron-proton bound state known as the deuteron, 
we find things are not so straightforward. The nuclear force between the proton and 
neutron is a short-range force: essentially, the proton and neutron interact strongly 
only if the particles touch each other. Thus we don’t have macroscopic equations, like 
those in electromagnetism, that we can apply on the microscopic scale for nuclear 
physics. Instead we try to deduce the nuclear-force law by guessing or modeling the 
nature of the nuclear interaction and then comparing the results of our quantum me¬ 
chanical calculations with experiment. As we will see in our analysis of the deuteron, 
this approach faces severe limitations. 

The simplest model of the nuclear force between a proton and a neutron is a 
spherical well of finite range a and finite depth F 0 , shown in Fig. 10.7. The potential 
well, which looks square in Fig. 10.7a, is often referred to as a “square” well but 
is really a spherical well in three dimensions, as indicated in Fig. 10.7b. Unlike the 
hydrogen atom with its infinite set of bound states, experiment reveals that there is 


9 For a discussion of the dynamical symmetry associated with the hydrogen atom, sec L. I. 
Schifif. Quantum Mechanics , 3rd ed., McGraw-Hill. New York, 1968. Section 30. We will return 
to the subject of accidental degeneracy at the end of this chapter. 
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(a) 



(b) 


Figure 10.7 (a) A graph of the potential energy of the finite spherical well (10.53 Mb) The 
region of the well shown in three dimensions. 


only a single bound state for the n-p system. All the excited states of the two-nucleon 
system are unbound. 

The ground state of the potential well 


V = 


—V 0 r < a 
0 r > a 


(10.53) 


is an / = 0 state, for which there is no centrifugal barrier. We thus solve the radial 
equation for an / = 0 bound state, one with energy — V 0 < E < 0: 


hr d 2 u 

- — —-V 0 u = Eu r < a 
2p dr- 


h 2 d 2 u 

-- = Eu 

2p dr 2 


r > a 


We can express this equation in the form 


(10.54a) 

(10.54b) 


where 


and 


d 2 u 2u 

— = - —(Vo + £)u = -^i< r<a 

dru 2 p.E 

—- —- —u = q u r > a 

dr 2 H 2 


*° = \/f (V(, + £) 


(10.55a) 


(10.55b) 


(10.56a) 


Q ~ 



(10.56b) 


so that q and k 0 arc both real for the energy in the range — V 0 < E < 0. 
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The solutions to (10.55) are just 

« = A sin k 0 r + B cos k§r r < a (10.57a) 

u = Ce~ qr + De qr r>a (10.57b) 

The boundary condition «(0) = 0 tells us that B = 0. while the requirement that our 
m ilution satisfy the normalization condition (10.5) demands that we set D = 0. Thus 
the lorm for the radial wave function u must be 

u = A sin k 0 r r < a (10.58a) 

u = Ce~ qr r > a (10.58b) 

Since the differential equation (10.54) is a second-order differential equation 
with a finite potential energy, the first derivative of u must be continuous so that 
the second derivative exists and is finite everywhere. Correspondingly, in order for 
the first derivative to be well defined, the function u must be continuous everywhere. 
Satisfying the continuity condition on it at r = a yields 

A sin kfid = Ce~ qa (10.59a) 

while making the derivative of u continuous at /• — a yields 

Ako cos k a a = —qCe~ qu (10.59b) 

Dividing these equations, we find 


tan V = —— (10.60) 

q 

Equation (10.60) is a transcendental equation that determines the allowed values 
of the energy. A convenient aid to determining graphically the energy eigenvalues is 
to inUoduce the variables k () a = f and qa = q. Expressed in terms of these variables, 
(10.60) becomes 


C cot k = -n 


(10.61) 


Note that 


2 2 2/rV'orr 


(10.62) 


which is independent of the energy E. Figure 10.8 shows aplot of (10.61) and (10.62) 
in the fyty plane. From the figure we see that there are no bound stales unless 


2/xV 0 a 


fi 2 


my 


(10.63a) 
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Figure 10.8 A plot of f cot f = — t] and f 2 + rj 2 = 2 fi\' ll a 2 /lr = r 2 
for three values of the radius r 0 , which yield zero, one. and two bound 
states, respectively. 


For 

(|) 1 < ^<(f) 3 

there is a single bound state, and so on. These results are to be contrasted with the 
results from the one-dimensional finite square well, for which there is at least one 
bound state no matter how shallow or narrow the well. Although (10.54) has the same 
form as the energy eigenvalue equation for a one-dimensional well, die boundary 
condition m( 0) = 0 restricts the eigenvalue condition to (10.61), eliminating the 
C tan £ = i] curves that would fill in the "missing space" in Fig. 10.8 in the purely 
one-dimensional system (see Problem 10.9). 

From analysis of the threshold for y+d-Mi + p. the photodisintegration of 
the deuteron, we know that the deuteron bound state has an energy £ = —2.2 MeV. 
However, with a single bound state wc cannot do more than determine the product 
of Vna 2 . Let’s take the value for a to be roughly 1.7 x 10“ 15 cm, as determined 
from scattering experiments, and determine the value for the depth V 0 of the nuclear 
potential well. To get started, assume that | £T | <$C V 0 , that is, the deuteron is just barely 
bound. If this is so, the curves in Fig. 10.8 intersect when 2 /j.V 0 a 2 /fv = (tr/2) 2 . 
Using the experimental value for a, we find V 0 = 35 MeV. in agreement with our 
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V 



(a) (b) 

Figure 10.9 (a) The potential energy of a finite spherical well showing the energy of the 
bound state corresponding to the deuteron relative to the depth of the well, (b) A sketch 
of the radial wave function it(r) for the deuteron. Note that the wave function extends 
significantly beyond the range a of the well. 

assumption that the deuteron binding energy is much smaller than the depth of the 
potential well. In Fig. 10.9a we redraw the potential well and the energy of the 
bound state more to scale. We now have enough information to sketch the function 
u in Fig. 10.9b. The two nucleons have a substantial probability of being separated 
by a distance that is greater than the range of the potential well and the shape of 
the wave function is. correspondingly, not very sensitive to the detailed nature of the 
potential. 

Even though the square well is not an especially realistic model for the inter- 
nuclear potential, it does give us some useful information about the nuclear force. 
However, we now see the problem in trying to understand nuclear physics by study¬ 
ing the two-body bound-state problem. With just a single bound state, we do not have 
enough information to gain a detailed picture of the nuclear interaction from a study 
of the bound-state spectrum. It is worth noting here, however, that this single bound 
state does give us some additional information. The deuteron has an intrinsic spin of 
one; the proton and the neutron do not bind together in a spin-0 state, indicating that 
the nuclear force is spin dependent. The deuteron has a magnetic moment and an 
electric quadrupole moment. The existence of an electric quadrupole moment tells 
us that the probability distribution in the ground state is not spherically symmetric. 
Thus this system is not strictly described by a central potential. However, the depar¬ 
ture from spherical symmetry turns out not to be large. More detailed calculations 
show that the ground state of the deuteron is a mixture of 96 percent l = 0 and 4 per¬ 
cent 1 = 2 states. This mixing is due to a spin-orbit coupling in the nucleon-nucleon 
interaction that we have neglected in our simple model. We will see more evidence 
for this spin-orbit coupling in the next section when we look at a system of many 
nucleons, and we will sec how spin-orbit coupling can arise in atomic systems in 
Chapter 11. 
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10.4 The Infinite Spherical Well 


One way to learn more about nuclear interactions is to study systems containing a 
larger number of nucleons, that is, heavier nuclei. One convenient way to describe 
such a multibody system is in terms of a model in which each nucleon moves 
independently in a potential well due to its interactions with the other nucleons. 
A simple but useful model—known as the Fermi gas model—is to take the potential 
energy of each nucleon to be that of a spherical potential well like that shown in 
Fig. 10.7. As we add more and more nucleons to the well, the size of the well 
increases. Note that the effect of increasing a for the I — 0 solutions to the finite 
potential well is the same as increasing the depth of the well Vq. In either case, 
the radius of the circle formed by f 2 + r / 2 grows, and the intersection points of the 
circle and f cot £ = — 77 approach £ = ntr as the radius approaches infinity. In order 
to determine the energy levels of the / ^ 0 as well as the / = 0 states, we therefore 
examine the infinite potential well 


V = 


0 r < a 
00 r > a 


(10.64) 


This well will give us an idea of the ordering of the energy levels for a spherical 
well, and it is much easier to solve than the finite potential well. 

What is the behavior of the wave function in a system where the potential energy 
jumps discontinuously to infinity? Notice in the explicit form (10.58b) for the / = 0 
wave function in the region outside the finite potential well that as the energy E takes 
on values that are more negative and therefore further below the top of the potential 
well, the exponential tail of the wave function falls off more rapidly. In the limiting 
case that the energy is infinitely far below the top of the well, the wave function 
vanishes in the region r > a. As we saw when we discussed the one-dimensional 
particle in the box, in this limit the derivative of the wave function is discontinuous 
at the boundary, as shown in Fig. 6.10. In fact, in order to obtain a solution to the 
differential equation (10.2) in this limit, we want the derivative to be discontinuous 
so that when we evaluate the second derivative we do get infinity. Otherwise we 
cannot satisfy the differential equation at the point where V jumps discontinuously 
to infinity. See (6.96). 

Because the potential energy inside the well is zero, we are searching for the 
solutions inside the well to the Schrodinger equation for a free particle. However, 
since we must satisfy the boundary condition 1 p(r = a. 0, (p) = 0. we need to find 
solutions in spherical coordinates of the form R(r)Y t m (6, <p ); we cannot just use 
the plane wave solutions (9.25). Rather than start with the radial equation for the 
function w(r), we return to the radial equation (10.2) for R(r) for the case V (r) — 0. 
This equation takes the form 


d 2 R 2 dR_ 
dr 2 r dr 


/(/ + 1 ) 


R+k 2 R = 0 


(10.65) 
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where 



( 10 . 66 ) 


With no potential energy, we can obtain a power-series solution to (10.65) in the form 
.'t a two-term recursion relation. However, we don't need to follow this procedure in 
this case, since (10.65) can be solved in terms of simple functions. If we introduce 
the dimensionless variable p = kr, (10.65) becomes 


^ + !" + r l _w±ifu = o 

dp 2 p dp |_ P 2 . 


(10.67) 


known as the spherical Bessel equation. Solutions to this equation that are regular 
at the origin are called spherical Bessel functions, 

Mp) = (-p) 1 (io.68a) 


while irregular solutions at the origin are called spherical Neumann functions. 

(10.68b) 




The first few functions, shown in Fig. 10.10, are 
sin p 


Mp) = 


. , . sin p cos p 

J\(P) = — - 

P 2 P 


Mp) ■ 


cos p 


, „ cosp sinp 
n\( p) = -^- 


, /3 1\. 3 cosp /3 1\ 

72 (P) = I -?-) sin p- — MP) = - ^-cos p - 

Vp 3 p/ p 2 Vp 3 pJ 


3 sin p 
P 2 

(10.69) 




Figure 10.10 (a) Spherical Bessel functions and (b) spherical Neumann functions. 
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Figure 10.11 The ground-state, first excited, and second excited radial wave functions 
R(r) for the infinite spherical potential well. 


For the spherical well we must choose solutions to (10.67) that satisfy the 
boundary condition that u(r) — rR(r) vanishes when r = 0. Thus we must discard 
the spherical Neumann functions. The energy eigenvalues are then determined by 
requiring that 


ji (ka) = 0 


(10.70) 


Let’s first examine the / — 0 condition, 

jo(ka) = 2E*Z = 0 (10.71) 

ka 


which is satisfied when ka = n r n, where n,. — 1, 2, 3, ... . The I = 0 energies are 
given by 


^n r ,l —0 


n 2 k 2 

2m 


2 n 



n r - 1. 2. 3. . .. 


(10.72) 


which agrees with our analysis of the finite well in the limit that the depth of the well 
approaches infinity. 10 The value of n r specifies the number of nodes in the radial wave 
function, as indicated in Fig. 10.11. The ground state has the only node occurring at 
r = a, the first excited I = 0 state has the second node occurring at r = a. and so on. 

For the higher order spherical Bessel functions we cannot determine the zeros by 
inspection, as we have done for the / = 0 state. However, these zeros are tabulated: 11 



/ = 0 

/ = 1 

1 = 2 

1 = 3 

n r = 1 

3.14 

4.49 

5.76 

6.99 

n r = 2 

6.28 

7.73 

9.09 

10.42 

n r — 3 

9.42 

10.90 

12.32 

13.70 


10 To compare the results, redefine the bottom of the finite potential to be at V =0 and the top 
at V = V 0 and then let V () -*■ oo. 

11 P. M. Morse and H. Feshbaeh, Methods of Theoretical Physics, McGraw-Hill, New York, 
1953, p.1576. 
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Figure 10.12 The energy levels of the infinite spherical well. 


Notice that the lowest zero ol' an / = 1 energy state occurs when ka = 4.49, and thus 
the energy is given by 


h 2 k 2 _ h 2 ( 4.49 \ 2 
" r=U=1 “ “ 2n { a ) 


(10.73) 


which is therefore intermediate between the 1 = 0 ground state [n r = 1 in (10.72)] 
and the first excited / = 0 state | n,. = 2 in (10.72)]. Figure 10.12 shows the energy 
spectrum for the infinite spherical well. Note the absence of any accidental degen¬ 
eracy. 

Let’s now try making a nucleus by filling the energy levels with protons and 
neutrons. Since protons and neutrons are both spin-4 particles, we will find in 
Chapter 12 that we can put no more than two protons and two neutrons in each 
of these energy states. If we neglect the Coulomb repulsion between the protons as 
a first approximation, the energy levels for protons and neutrons are the same. If 
we fill the levels with protons, for example, we can put two protons in the n r — 1, 
/ — 0 ground state. The next level is an n r = 1, / = 1 state into which we can put 
six protons, since there are three different m values for / = 1. The next energy 
level is an n r = 1, / = 2 state, which can fit 2 x 5 = 10 protons, since there are 
five different m values for / — 2. After we have filled this energy level, the next 
energy level is n r = 2,1 = 0, which can again accommodate two protons. In this 
way, we see that the energy levels will be completely filled when the number of 
protons is 2, 8 (= 2 + 6), 18 (= 8 + 10), 20 (= 18 + 2), 34 (= 20 + 14), 40, 
58,..., with a similar sequence for neutrons. Real nuclei exhibit special properties 
that are associated with filled energy levels, or closed shells, with the “magic” 
numbers 2. 8. 20, 28, 50. 82, and 126. The differences between the observed magic 
numbers and those in our very' simple model arise because, as for the deuteron, 
there is a strong “inverted" spin-orbit coupling that shifts the energy levels (see 
Fig. 10.13). 
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Figure 10.13 The ordering of the energy levels in a variety of potential energy wells. 
Adapted from B. T. Feld, Ann. Rev. Nuclear Sci. 2, 239 (1953), as reproduced by R. B. 
Leighton, Principles of Modern Physics, McGraw-Hill, New York, 1959. 


10.5 The Three-Dimensional Isotropic Harmonic Oscillator 


As our last example of analytically determining the energy eigenvalues of a central 
potential, we consider the three-dimensional simple harmonic oscillator, for which 
the potential energy is 


V (r) = ^ixco 2 r 2 — ^ H(jl> 2 {x 2 + y 2 + z 2 ) 


(10.74) 
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Such an oscillator is often referred to as an isotropic oscillator, since the “spring 
constant 7 ’ has the same magnitude in all directions. 12 One of the things that makes 
this isotropic oscillator especially noteworthy is that we can easily determine the 
eigenstates and eigenvalues in both spherical and Cartesian coordinates and then see 
the connection between two different sets of basis states. 


CARTESIAN COORDINATES 

The Hamiltonian for the three-dimensional isotropic oscillator is given by 


H - 


1 


+ ^/raT|r| 2 = 


2/i 


Px + Py + PI , 1 


+ -fi( 0 2 (x 2 + y 2 + i 2 ) (10.75) 




which in the last step has been expressed in Cartesian coordinates. This Hamiltonian 
is a sum of three independent one-dimensional harmonic oscillators: 


H = H X + H y + H z (10.76) 

where 

p2 . 

H x = — + -narx 2 (10.77a) 

2/i 2 

-2 

H v = + -n(o 2 y 2 (10.77b) 

2 ii 2 

= ^ + -iiorz 2 (10.77c) 

2/x 2 

Since these Hamiltonians commute with each other—f H x . W v J = 0, and so on—they 
have simultaneous eigenstates in common. 


IE) = | E y , E z ) (10.78) 

where 

H\E) - (H x + H y + H Z )\E X , E y , E z ) = (E x + E y + E Z )\E X , E y , E z ) (10.79) 

and hence E — E x + £ v + £.. Here we can take full advantage of the eigenstates 
of the one-dimensional harmonic oscillator that we found in Chapter 7. Namely, we 
can specify the eigenstates with the three integers 


\E) = | n x , n y , n z ) n x , n y , n z = 0, 1, 2,... 


(10.80) 


12 An anisotropic oscillator would have a potential energy of the form 
V = ^/Li( arx 2 + tu 2 y 2 + co\z 2 ) 
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where 


E — ^ hco + ^ j + ( n z + \ j ftco 

= (n x + n y + n z + Hco n x , n y , n z — 0. 1, 2, . .. (10.81) 

Setting n x + n Y + n z — n, we can express the total energy as 


E n = + 0 Hco n = 0, 1, 2, ... (10.82) 

We can write the energy eigenfunctions (x, y, z\n x , n v , « z ) as a product of three 
one-dimensional eigenfunctions that we determined in Chapter 7. It is instructive, 
however, to see how these eigenfunctions arise by solving the three-dimensional 
Schrodinger equation directly in position space, because this provides a good illus¬ 
tration of the technique of separation of variables that we have alluded to several 
times. We write the energy eigenfunction as 


(x, y, z\E) = X (x)K (y)Z(z) (10.83) 


and substitute it into the position-space energy eigenvalue equation: 


[-IG 


“TI liT + T5 + iT j + i"" 2 '* 2 + ,2 + z2) 

= EX(x)Y(y)Z(z) 


X(x)T(y)Z(z) 

(10.84) 


If we then divide this equation by the wave function X(x)Y(y)Ziz ). we obtain 


1 h 2 d 2 X 
X 2/r dx 2 


, 1 2 2 
+ - }LCO X 


Y 2/x dy 2 + 


1 2 7 
y 


+ 


\_ir_d^Z 

Z If! dz 2 


1 2 2 
-gw Z 


- E 


(10.85) 


This separation-of-variables approach (10.83) “works,” since the partial differential 
equation (10.84) can now be expressed as the sum of three independent pieces: the 
term in the first bracket in (10.85) is solely a function of .v, the second bracket is 
solely a function of y, and the third bracket is solely a function of z. Now x, y, and 
z are independent variables, and hence each of the functions in die brackets can be 
varied independently. Thus the only way for this equation to hold for all x, v, and z is 
for each of the terms in the brackets to be equal to a constant. With some foresight we 
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call the constants E x , £ v , and E : . Then (10.85) breaks into three separate equations: 

-—^ + -puo 2 x 2 X = E x X (10.86a) 

2/Lt dx 2 2 

12 y i 

-— + -hco 2 \ 2 Y = E X Y (10.86b) 

2H dy 2 2 

t- 2 j2 ^ 1 

— '-— + -Hco 2 z 2 Z = E z Z (10.86c) 

2/i dz 2 2 

where £,. + £ v + £. = £. Each of the equations (10.86) is an energy eigenvalue 
equation for a one-dimensional harmonic oscillator with the eigenfunctions 

X„ x (x) = (x\n x ) Y n (y) = (y|/i v ) and Z„ z (z) = (z\n 2 ) (10.87) 

where we have used the same bra-ket notation for each independent oscillator that 
we used in Chapter 7. The energy eigenvalues (10.81) and the energy eigenstates 
then follow directly from those results. 


SPHERICAL COORDINATES 


We next take advantage of the spherical symmetry of the Hamiltonian (10.75) to 
write the energy eigenfunction as 


{r| £) = (r, 0, HE) = R(r)Y, m (8, <f>) = — Y Lm (6, 0) 

r 

The radial equation is then given by 

fr d 2 u l (l + 1 )h 2 1 -> -> _ 

-- -|-- —u -I- -uarr u = Eu 

2/r dr 2 2fj,r 2 2 

Expressed in terms of the dimensionless variables 

IJuo 2 E 

p— — r and /. = — 

V h tuo 

the differential equation (10.89) becomes 

d 2 u /(/+ 1 ) 2 

--- - - U — p~U = —A II 

dp 2 P 2 


( 10 . 88 ) 


(10.89) 


(10.90) 


(10.91) 


We can see that attempting a power-series solution to (10.91) will meet with a three- 
term recursion relation. However, for p -» oo, the differential equation becomes 


d 2 u -> 

~7~2 ~ P u 
dp 2 

This suggests we search for a solution of the form 

« = p' + 'e~ p2/2 f(p) 


(10.92) 


(10.93) 
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where the first factor indicates the known behavior for small p for a spherically 
symmetric potential and the exponential indicates the asymptotic behavior for large 
p. We can then find a two-term recursion relation for the power series 

OC 

f{p) = £ c k p k (10.94) 

k =0 

It is straightforward to show that unless this power series terminates, it has the 

2 

behavior of e p for large p (see Problem 10.12). The energy quantization condition 
resulting from requiring termination of the power series is 

E-(ln r + l + ^jtw) n r = 0,1,2,... (10.95) 

where n r is the number of the nodes of the function f(p). Defining the principal 
quantum number n — 2n r + /, we obtain 

E n — hco n = 2n r +1 n — 0. 1. 2, .. . (10.96) 

in agreement with our earlier result. These energy levels are indicated in Fig. 10.14. 

DEGENERACY 

As with the hydrogen atom, one of the surprising features of the energy eigenvalues 
of the harmonic oscillator is the high degree of degeneracy. We can see this in both 
approaches to the oscillator. In Cartesian coordinates there are different combina¬ 
tions of n x , n y , and n z in (10.81) that all yield the same energy, while in spherical 
coordinates, for a particular value of n, the states with /=«.«— 2, ... ,1 or 0 all 
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Figure 10.14 The energy levels E n = (n + ^)hco of the 
isotropic harmonic oscillator, showing the degeneracy. 
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have the same energy [see (10.95)]. This degeneracy is illustrated for the first three 
energy slates below: 


Cartesian coordinates 


Spherical coordinates 


n= 0 

«v 

= 0 

It y = 0 

n z = 0 

1 state 

n =0 

1 = 0 m = 

0 


"x 

= 1 

n y = 0 

n z = 0 





= 1 

n = 1 

n.v 

= 0 

fly = 1 

n z = 0 

3 states 

n = 1 

1= 1 

» 

= 0 


n x 

= 0 

Ity = 0 

n. = 1 




» 

= -1 


\ n x 

— 2 

fly = 0 

n, = 0 



f/ = 0 


111 = 


n = 2 


n x = 0 n y = 2 n. — 0 
n x = 0 n y = 0 n.= 2 
n x — 1 n v = 1 n, = 0 
n x — 1 ii Y = 0 n, = 1 
n x — 0 n y = 1 n z — 1 


6 states n ■-* 2 


1 = 2 


in = 2 
m = 1 
m = 0 
in = — 1 
m = —2 


If we look at the position-space wave function for the ground state, we see that 


(x, y, z\n x = 0. n y = 0, n z = 0) = X 0 (x)Y 0 (y)Z 0 (z) 

( \ 3/4 

] e -iuox 2 /2h e -iuoy 2 /2n e -iMt 1 /2h 

Jtnj 

= (WL V /4 e -««r 2 /2fl 
\nh) 


(10.97) 


where we have used the form for the wave function (7.43b) for X 0 (.r) and the 
corresponding expressions for Y Q (y) and Z 0 (z). 13 Notice that in the last step we 
have gone from an energy eigenfunction expressed in Cartesian coordinates to one 
expressed in spherical coordinates. The lack of angular dependence tells us that this 
is indeed a state with 1 = 0. However, if we take one of the three n = 1 eigenfunctions 
in Cartesian coordinates, 


(,v, y, z\n x = I, iiy = 0. n, = 0) = X x (x)Y 0 (y)Z 0 (z) 



(10.98) 


We have replaced the mass m in the one-dimensional harmonic oscillator wave functions 
'a ith the reduced mass /i in accord with (10.84). 
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(a) (b) 

Figure 10.15 (a) The classical orbits of a particle moving 
in a pure Coulomb or isotropic oscillator central potential 
close on themselves, (b) The classical orbits for other 
central potentials do not close and the orbit precesses. 

i 

we can recognize the angular dependence as a linear combination of Yy j and Yy _ lf 
showing that the n — 1 states do have / = 1. 

The high degree of degeneracy for the isotropic harmonic oscillator is reminis¬ 
cent of that for the hydrogen atom. Here too there is a “hidden” symmetry that is 
responsible. 14 In Chapter 9 we saw that symmetries lead to conservation laws, and so 
it is natural to ask what is conserved in these two central-force systems in addition to 
orbital angular momentum. Classically, conservation of orbital angular momentum 
means the orbital angular momentum points in a fixed direction. Consequently, the 
classical orbit must reside in a plane. In addition, the l/r and r 2 central potentials 
share an unusual feature in classical mechanics: they are the only ones for which 
the orbits close upon themselves and do not precess (see Fig. 10.15). Thus within 
the plane of the orbit there is an additional constant of the motion for these two 
potentials—a vector pointing from the apogee to the perigee of the orbit maintains 
its orientation in space. 


10.6 Conclusion 


In this chapter w’e have examined almost all of the energy eigenvalue problems for a 
central potential that have exact solutions. In the case of the isotropic oscillator, we 
can solve the eigenvalue equation in two different coordinate systems. Surprisingly, 
the 1 /r potential can also be solved in two different coordinate systems, parabolic as 
well as spherical. There is a certain irony in this because there are so few problems 
we can solve exactly, and we can solve each of these two in two different w'ays. 
Nonetheless, we should be grateful that we can solve these particular problems at 


14 See the discussion of accidental degeneracies in R. Shankar. Principles of Quantum Me¬ 
chanics, Plenum. New York. 1980. 
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all. After all, the solutions to the Coulomb potential form the foundation for our 
analysis of the hydrogen atom, which continues in Chapter 11. 

Problems 


10.1. The position-space representation of the radial component of the momentum 
operator is given by 



Show thatforitsexpectation value to be real: {\j/\p r \^r) — (tj/\p r \tj/)*, the radial wave 
function must satisfy the condition u(0) = 0. Suggestion: Express the expectation 
value in position space in spherical coordinates and integrate by parts. 


10.2. An electron in the Coulomb field of the proton is in the state 

|^> = ±|1,0, 0) + |[2. 1. 1) 

where | n, /, m) are the usual energy eigenstates of hydrogen. 

(a) What is (£) for this state? What are (L 2 ) and {£,)? 

(b) What is \xj/ (r)>? Which of the expectation values in (a) vary with time? 


10.3. A negatively charged pion (a spin-0 particle) is bound to a proton forming a 
pionic hydrogen atom. At time t = 0 the system is in the state 

|^) = ||1, 0, 0> + -J=|2, 1, 1> + ^|2, 1,0) 


In an external magnetic field in the z direction, the Hamiltonian is given by 

- p 2 e~ f 
H ~ Z 777 + WyL. 

2/x |r| 


where p is the reduced mass of the pion-proton system. 

(a) What is |i/r(f)), the state of the system at time r? What is {£) at lime t for this 
state? 

(b) What are (L x ) and (L.) at time t for this state? 


10.4. Calculate the probability that an electron in the ground state of hydrogen is 
outside the classically allowed region. 

10.5. What is the ground-state energy and Bohr radius for each of the following 
two-particle systems? 

(a) 2 H, a bound state of a deuteron and an electron 
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(b) Positronium 

(c) A bound state of a proton and a negative muon 

(d) A gravitational bound state of two neutrons 

What is the wavelength of the radiation emitted in the transition from the n = 2 state 
to the n = 1 state in each case? In what portion of the electromagnetic spectrum does 
this radiation reside? 

10.6. Use the power-series solution of the hydrogen atom to determine m 3 . 0 ( 1 °) • 
Ignore normalization. Compare your answer with (10.45a). 

10.7. An electron is in the ground state of tritium, for which the nucleus is the isotope 
of hydrogen with one proton and two neutrons. A nuclear reaction instantaneously 
changes the nucleus into ? He, which consists of two protons and one neutron. 
Calculate the probability that the electron remains in the ground state of the new 
atom. Obtain a numerical answer. 


10.8. Show that there are no allowed energies E <—V 0 for the potential well 

— Vo r < a 
V = 0 

0 r > a 

by explicitly solving the Schrodinger equation and attempting to satisfy all the 
appropriate boundary conditions. 


10.9. Use the techniques illustrated in Section 10.3 to solve the one-dimensional 
potential well 


V(jc) = 


— Vo \x\<a 
0 |x| > a 


Show that there always exists at least one bound state for this well. 


10.10. Determine the ground-state energy of a particle of mass p in the cubit 
potential well 


V( Xi ) = 


0 0 < Xj < a 

oo elsewhere 


x, = x, y, z 


Compare the volume of this infinite well with the spherical one (10.64) and discuss 
in general terms whether the relative values of the ground-state energies for the two 
wells are consistent with the position-momentum uncertainty relation. 


10.11. A particle of mass n is in the cylindrical potential well 


V(p) = 


» = J* 2 + V 2 


0 p < a 
oo p > a 
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(a) Determine the three lowest energy eigenvalues for states that also have p z and 
L z equal to zero. 

(b) Determine the three lowest energy eigenvalues for states with p z equal to zero. 
The states may have nonzero L z . 

Suggestion: Work out Problem 9.23 before attempting this problem. Watch out 
for the appearance of Bessel's equation and ordinary Bessel functions when solving 
the radial equation. 

10 . 12 . 

(a) Substitute the expression (10.93) for the radial wave function of the three- 
dimensional isotropic oscillator into (10.91) to determine the differential 
equation that f{p) obeys. 

(b) Obtain a two-term recursion relation for the power series (10.94); show 
that this power series must terminate and that the energy eigenvalues of the 
oscillator are given by (10.95). 

10.13. Expectation values are constant in time in an energy eigenstate. Hence 

^-^ = i(£|[ff,f-pj|£)=0 

dt h 

Use this result to show for the Hamiltonian 

*2 

= Jp + V(|f|) 


<A H£H <r ^ 

which can be considered a quantum statement of the virial theorem. 


10.14. 

(a) Calculate ( V) for the ground state of hydrogen. Show that E = (V)/ 2. What 
is ( K), the expectation value of the kinetic energy, for the ground state? 
Show that these expectation values obey the virial theorem from classical 
mechanics. 

(b) Calculate ( V ) for the ground state of the isotropic three-dimensional harmonic 
oscillator. How are (K) and (V) related for the oscillator? What do you expect 
based on the virial theorem? Explain. 

10.15. Suppose that nucleons within the nucleus are presumed to move indepen¬ 
dently in a potential energy well in the form of an isotropic harmonic oscillator. 
What are the first five nuclear "magic numbers” within such a model? 
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10.16. The potential energy in a particular anisotropic harmonic oscillator with 
cylindrical symmetry is given by 

V = ^n[a>*(x 2 + y 2 ) + W3Z 2 ] 

with to) < cot, < 2o) v 

(a) Determine the energy eigenvalues and the degeneracies of the three lowest 
energy levels by using Cartesian coordinates. 

(b) Solve the energy eigenvalue equation in cylindrical coordinates and check 
your results in comparison with those of (a). 


10.17. Consider the Hamiltonian for the two-dimensional motion of a particle of 
mass ix in a harmonic oscillator potential: 


it Px , 1 2-2 Py ,1 2-2 

H = — + -ixco x + — + -lid) y 
2n 2 2fi 2 


(a) Show that the energy eigenvalues are given by E„ = (n + 1 )tuo, where the 
integer/! =n l -f n 2 , with n |, n 2 = 0, 1. 2 ,... 

(b) Express the operator L z = xp > — yp x in terms of the low ering operators 





(* + —Px) 

\ ) 


and 



and the corresponding raising operators a\ andai. Give a symmetry argument 
showing that [H, Z..] = 0. Evaluate this commutator directly and confirm that 
it indeed vanishes. 

(c) Determine the correct linear combination of the energy eigenstates w ith en¬ 
ergy £j = 2 Hco that are eigenstates of L z by diagonalizing the matrix repre¬ 
sentation of L- restricted to this subspace of states. 


10.18. The spherically symmetric potential energy of a particle of mass // is given by 


V(r) = 


0 a < r < b 
00 elsewhere 


where r = ^Jx 1 + y 2 + z 1 . 

(a) Determine the ground-state energy. 

(b) What is the ground-state position-space eigenfunction up to an overall nor¬ 
malization constant? What condition would you impose to determine this 
constant? 

(c) What is the energy of the first excited / = 0 state? Explain why it would not 
be so straightforward to determine the energy of the / = 1 states. 
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10.19. The Hamiltonian for two spin-j particles, one with mass m x and the oth< 
with mass m 2 , is given by 


H = 


i 


^ 2 

p; 


+^+v a m+l- A 


2m 1 2m 


S, S 2 

fi 2 


v fc (|f|) 


where r = r x — r 2 and 


V a (r) = 


0 

V 0 


r < a 
r > a 


V b (r) = 


0 r <b 
Vo r>b 


with b < a and V 0 very large and positive. 

(a) Determine the normalized position-space energy eigenfunction for the groun 
state. What is the spin state of the ground state? What is the degeneracy 
Note: Take V 0 to be infinite where appropriate to make the calculation t 
straightforward as possible. 

(b) What can you say about the energy and spin state of the first excited state 
Does your result depend on how much larger a is than hi Explain. 


Page 397 (metric system) 


CHAPTER 11 


Time-Independent Perturbations 




Obtaining quantitative agreement between theory and experiment in the real world 
has its ups and downs. The bad news is that there aren’t any interacting systems 
that have Hamiltonians for which we can determine the energy eigenvalues and 
eigenstates exactly. The good news is that because a number of extremely important 
physical systems are sufficiently close to ones that we can solve, such as the harmonic 
oscillator and the hydrogen atom, we can treat the differences as perturbations and 
deal with them in a systematic way. In the beginning of this chapter we wall focus 
on the effect of an external electric field on a number of familiar systems—the 
ammonia molecule treated as a two-state system, the one-dimensional harmonic 
oscillator, and the hydrogen atom. We will then consider the effect of internal 
relativistic perturbations in the hydrogen atom, leading to the fine structure. We will 
also investigate the effect on the hydrogen atom of an external magnetic field, the 
Zeeman effect. 

11.1 Nondegenerate Perturbation Theory 


We begin by expressing the Hamiltonian for some system in the form 

H = Hq + Hi ( 11 . 1 ) 

where the part of the Hamiltonian that is presumed to be "big” is H 0 . often called 
the unperturbed Hamiltonian, and Hi is the “small” part, often referred to as the 
perturbing Hamiltonian. For a perturbative approach to work, we must be able to 
determine the eigenstates and eigenvalues of H 0 : 

H 0 \<p«») = |*>f) (11.2) 

where |^ 0) ) = |£^ 0) > is the eigenstate with energy E®\ Of course, we are presuming 

381 
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that v\ c are not able to determine the energy eigenstates and eigenvalues of the full 
Hamiltonian: 


m n )=EM (n.3) 

so we will attempt a solution of (11.3) in the form of a perturbative expansion. 

In order to keep track of the order of smallness in our perturbative expansion, it 
is convenient to introduce a parameter k into the Hamiltonian: 

H^Ho + kHi (11.4) 

Thus by adjusting the value of k, we can adjust the Hamiltonian. In particular, as 
k —> 0 and we turn off the perturbation, H /7 n , while as k —* I, H -*■ // 0 + H h the 
full Hamiltonian for the system. 1 We assume that we can express the exact eigenstates 
and eigenvalues as a power-series expansion in k: 


\fn) = WT) + MV?) + * 2 ! <P?) + ■ ■ ■ ( 11 - 5 ) 

E„ = £< 0) + aE'" + A 2 E« 2) + • • • (11.6) 

If this perturbative expansion is to be useful, successive terms in the series must 
grow progressively smaller, and we can then obtain a reasonable approximation to 
the full energy eigenvalue equation by retaining just the first few terms. In particular, 
note that we are presuming that as k —► 0, E n —*■ E^ 0) and \\jr„) —► |^ 0) ) smoothly, 
as indicated in Fig. 11.1. 

As an example illustrating how a series expansion such as (11.6) might arise, 
let’s first reexamine the two-state system of the ammonia molecule in an external 
electric field, which we analyzed in Section 4.5. There we noted that the matrix 
representation of the Hamiltonian can be expressed as 

£_/<l|tf|l) <l|tf|2)\ = /£ 0 + /t e |E| -A \ 

A(2|tf|l) (2\H\2) ) V -A E 0 -n e \E\) 

which has the exact eigenvalues 

£ = E 0 ± v /(m,|E|) 2 + A2 (11.8) 


1 Some authors prefer to consider k as part of the real Hamiltonian, rather than just a parameter 
that is introduced to help keep track of smallness. The problem with this alternative approach is 
that it is sometimes difficult to see at the start a natural small dimensionless parameter in the system 
that can play the role of k. 
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E( A) 



Figure 11.1 A schematic diagram showing how 
the energy levels of the Hamiltonian (11.4) might 
change as X varies between 0 and 1. 


For external electric fields that satisfy ^t g |E| <<C A, we can expand the square root to 
obtain the following power series for the energy: 


E = E 0 ±A±A 


I lElj 2 l^lElj 4 



(11.9) 


Notice that as /z c |E| -» 0. the energies go smoothly into the energies of the molecule 
in the absence of the electric field, namely, Ef" = Eq — A. which has the cor¬ 
responding eigenstate |/) = (l/\/2)(|l) 4- |2}) that we found in Section 4.5, and 
Ef} ] = £o + A, which has the corresponding eigenstate |//) = (l/\/2)(|I) — |2>). 
The exact eigenstates of the Hamiltonian (11.7) can also be expressed as a power 
series in the small quantity fi e \E\/A, with the zeroth-order terms given by |7) and 
| II) (see Problem 11.5). 2 

Let’s return to the general problem of determining the expansions (11.5) and 
(11.6) when we are not able to determine the eigenstates and eigenvalues exactly. 
Substituting (11.5) and (11.6) into the energy eigenvalue equation (11.3). we obtain 

(w 0 +(i*r>++x 2 i^ 2, >+• • •) 

= (f' 0) + kE™ + A 2 E' 2) + • • ■) (|^ 0) > + X|^>) + X 2 |^ 2) > + • • ■) (11.10) 

Since k is an arbitrary parameter, for (11.10) to hold, the coefficients of each power 
of k must separately satisfy the equation. The terms that are independent of k , the 
k° terms, are just (11.2), or 


tfoK 01 )= 


(11.11a) 


2 We will continue our discussion of the ammonia molecule in an external electric field in 
Section 11.4. 
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The a terms yield 

«oi^ u )+ »i\<p?) =(n.i ib> 
while the X 2 terms yield 

W 0 |^ 2 >) + H x \<p ( ") = E<°V< 2) > + £< ,) |^ 1) > + E™\<pf ] ) (11.1 lc) 

and so on. You can see the pattern that arises if we were to go on to consider higher 
order terms. 


THE FIRST-ORDER ENERGY SHIFT 

A useful procedure for extracting the information contained in these equations 
is to take the inner product with the complete set‘of basis bras We start 

with (11.1 lb) and take the inner product with (<p* 0) | to obtain 

+ <^ 0) itfii^ 0) > = £;°v ( v , , l) >+^Vfi^ 0) > (ii i2) 

Since 

(ii-i3) 

and we are presuming that 

rfV 0) > = 5 *» (H-14) 

(11.12) becomes 

£( n l) = <^ 0) |",l^ 0) > (11-15) 

The first-order shift in the energy is simply the expectation value of the perturbing 
Hamiltonian in the unperturbed state corresponding to that energy. 


THE FIRST-ORDER CORRECTION TO THE ENERGY EIGENSTATE 

Taking the inner product of (11.1 lb) with {<pf ] \ for k ^ n, we find 


or 


,o, _ <W°'l«.l^ 0 ’) 

[(fi k 'n ) ~ r .(0) r-IO) 


F (0) _ r-(0) 

r.„ i* k 


k n 


If we use the basis slates |^ Ul ) to express |^ ,) ) as 


(11.16) 


(11.17) 



then (11.17) tells us how much of l^ 1 ’) lies along each of the |^{ 0) ) fork ^ n. 
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What about (<^ 0) |^)? We return to (11.5) and require that 

1 = (fnlfn) = (‘P i ° > \<Pn ) ) + d- (11-19) 

Since (<p‘ 0) i^, ( ,° l ) = 1- through first order in A we must have 

— ia a real (11.20) 

and therefore 


Wn) = l^ 0) > + /<2A|^ 0) > + A^ l</ 3 f , )(<pfVl 1) > + 0( A 2 ) 

k^n 

= I^X^I^) + O(A^), (11.21) 

k^n 

where in the last step we have taken advantage of the fact that e iaX = 1 + ia), + 
0{ A 2 ). Even after | ifr n ) is normalized, its phase can be chosen arbitrarily; thus it is 
convenient to require that a = 0, or 


(f®IV®)=0 


(1 1 . 22 ) 


so that, to this order in A, \ty n ) and \(p^) have the same phase. This is a natural 
choice, since then the first-order correction \<p^) is orthogonal to the state |^ 0) ) 
and the perturbative correction generates the state 1i//„) which, to this order, “points” 
in a slightly different direction, as depicted in Fig. 11.2. Thus 


ky^n 


(Oh (<Pt 
k > 


(0)iu 
k 


H x \ ^ 0) > 


£, ( , 0) - £ 


( 0 ) 


+ 0( A 2 ) 


(11.23) 


THE SECOND-ORDER ENERGY SHIFT 

Let’s go on to determine E®. We take the inner product of (11.11c) with the bra 

<<a ; 

<^ 0) itfoip‘ 2) > + <^ 0) i^ii^ 1} > - w™) + e/^^v^) + ^ 2 V 0) wT) 

(11.24) 



Figure 11.2 A pictorial representation of first-order non¬ 
degenerate perturbation theory using ordinary vectors. For 
perturbation theory to be effective, the “angle” between \<p^) 
and | Vhi) must be small. Remember that ket vectors are vec¬ 
tors in a complex vector space, so this picture with real angles 
should not be taken literally. 
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Taking advantage of (11.13), (11.17), and (11.22), we obtain 

k^n h k 

where in the last step we have used 

<^ 0) |£il^ 0) ) = 

since is Hermitian. 

Note that to calculate the first-order shift in the energy in (11.15 ), all we need is the 
zeroth-order state. Similarly, in order to calculate the second-order shift in the energy 
in (11.25), all we need to know is the first-order correction to the state. In general, 
calculating the energy to order s requires knowledge of the state to order s — 1. 
Although we could go on to determine higher order corrections, we will find (11.15) 
and (11.25) adequate for our purposes. 


= £ 




nTO)_ 

k^n c 'i c k 


(11.25) 




(11.26) 


EXAMPLE 11.1 Before turning our attention to fully three-dimensional sys¬ 
tems, let’s apply the results of Section 11.1 to our favorite one-dimensional 
system, the simple harmonic oscillator. We suppose that a particle with 
charge q is in a harmonic oscillator potential and that we perturb the system 
by applying a constant electric field E that points in the positive x direction. 
Since there is a constant force q E exerted on the particle, the additional con¬ 
tribution to the potential energy, which is the extra work that we must do to 
displace the particle by a distance x from the origin, is —q\E\x. Therefore 
the Hamiltonian of the system is given by 

*2 

tj T’v 1 2-2 .pi* 

H ~ -f- -mm x — <?|E|x 

2m 2 

We break up the Hamiltonian into two parts: 


Px 


. 2-2 


H 0 = — + -meofx 
2m 2 

Hi = -q\E\x 

Determine the corrections to the unperturbed energies through second order. 3 


3 Note that we can express H\ as the usual electric dipole interaction Hamiltonian 


H } = • K 
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SOLUTION The energy eigenvalues of the unperturbed Hamiltonian are 
given by 

E n 0) = (» + b h0 > 

There are a number of easy ways to evaluate the first-order shift in the energy. 
Using 


x = 


h 


2mco 


(a + a’) 


from Chapter 7. we find that 

= -?|E|/ 


2m co 


(n|(a + a^)|n) = 0 


It is also instructive to evaluate the expectation value in position space: 

/ CO 

dx |(x|n)| 2 .v = 0 

•oo 


Here the integral vanishes because the energy eigenfunction [x j/;) is an even 
or odd function (see Section 7.10), and hence |{x|/t)| 2 is always even and 
x\{x\n) | 2 alw'ays odd. Since the first-order shift in the energy is just propor¬ 
tional to the expectation value of the electric dipole moment operator qx , 
the vanishing of this first-order correction can be ascribed to the absence of 
a permanent electric dipole moment that can interact with the applied electric 
field. 

The second-order shift in the energy is given by 

£(2) = y- l(ft|gilrc )! 2 
" j“ (n + \)hco - (k + j)hco 

Since 


(&|W,|n) - —*/|E| 



l(it|/i 


1> + \fn{k\n - 1» 


where the electric dipole moment operator jl L , = qx. There is nothing wrong with introducing the 
electric dipole moment 

ft, = 1] ?,-r, 

i 

of a system of charges that has a net charge q. However, in this case the dipole moment depends on 
where you locate the origin of your coordinates. If you prefer to deal with a neutral system for the 
one-dimensional harmonic oscillator, you can add a charge —q on a heavy mass that effectively 
resides at the origin. 
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there are contributions to the sum for the second-order shift in the energy 
when k = n + I and k = n — 1. Thus 

£( 2 ) _ <rlE| 2 ft / n + 1 + _n_\ _ <? 2 |E| 2 

" 2 m to V — hco fico) 2 mco 2 

What is the physical source of this nonvanishing higher order contribution? 
On average, the electric field causes the particle to be displaced from the 
origin, inducing a dipole moment proportional to the magnitude of the 
electric field. This induced dipole moment itself interacts with the applied 
field, giving a contribution to the energy that is proportional to the magnitude 
of the field squared. 

In this particular problem we have a simple way to confirm the results. We 
really didn't need to use perturbation theory for the full Hamiltonian because 
we can determine the eigenvalues and eigenstates exactly by “completing the 
square”: 


H = + -mco 2 x 2 — <jr|E|x 

2m 2 

2 in 2 \ mco 2 / 2 mco 2 

Figure 11.3 shows a graph of the potential energy for this Hamiltonian. It is a 
pure harmonic oscillator potential, just shifted along the x axis by q\E\/mar 
and shifted down in energy by q 2 \E\ 2 /2mor. In order to solve formally 
the quantum mechanical energy eigenvalue equation, we define the shifted 
position operator 


.v, = x — 


q |E| 


mco- 


which satisfies the usual commutation relation f.i v , p x ] = ih with the mo¬ 
mentum operator p x . Thus the exact eigenvalues of the Hamiltonian 


are given by 


H = & + -mco 2 x 2 - ‘ rlEi " 


2m 


2 mco 2 


" V 2 ) 2 mco 2 


in agreement with our earlier perturbative results. The exact eigenstates are 
those of the usual harmonic oscillator, only shifted in position by q\E\fmor. 
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V 



Figure 11.3 Graphs of the potential energy V(x ) = imto : x 2 of 
the harmonic oscillator (dashed line) and the potential energy 


V(x) 


Inuu 2 


(- 


<m \ 2 _ q 2 i £ i~ 


mo> 


2 / 2 mar 


of the oscillator in ati external electric field (solid line). 


These eigenstates can thus be expressed in terms of the translation oper¬ 
ator by 

\yjr n ) = f(q\E\/mco 2 )\n) = e - iqm ^ ,ma?, '\n) 

You can verify that this exact expression for the eigenstates agrees with the 
perturbative expansion (11.23) (see Problem 11.2). 


11.2 Degenerate Perturbation Theory 


If we try' to apply the formalism of perturbation theory when there is degeneracy, 
we face a crisis. In particular, the first-order correction to the eigenstate and. conse¬ 
quently, the second-order shift in the energy involve the quantity 


<«! 0 i«.k 0l > 

r-(O) _ r-(O) 


(11.27) 


which diverges if there exist states other than |<p^ 0) ) with energy /T 0 ’. that is, if there 
is degeneracy. In our earlier derivation we assumed that each unperturbed eigenstate 
|^7 0) ) turns smoothly into the exact eigenstate as w'e turn on the perturbing 
Hamiltonian. However, if there are N states 



i = 1. 2. N 


(11.28) 
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(b) 


Figure 11.4 (a) The problem with degenerate perturbation theory for two-fold 
degeneraey. Neither the vector nor |^°i> “points” sufficiently close to 
the exact vectors \ijr n |> or |i/r n2 ). (b) Degenerate perturbation theory selects 
the “right" linear combinations of states so that the perturbative correction is 
small. Remember that ket vectors are vectors in a complex vector space, so 
these pictures with real angles should not be taken literally. 


all with the same energy, it isn’t clear which are the right linear combinations of the 
unperturbed states that become the exact eigenstates. For example, in the case of 
two-fold degeneracy, is it 

and i*® 

or 

or some other of the infinite number of linear combinations that we can construct 
from these two states? If we choose the wrong linear combination of unperturbed 
states as a starting point, even the small change in the Hamiltonian generated by 
turning on the perturbation with an infinitesimal k must produce a large change in 
the state. See Fig. 11.4. 

In order to determine appropriate linear combinations of unperturbed states, we 
return to our expansion (11.5). Allowing for degeneracy, we write 4 

N 

I *«) = £ c,>2> + + ■ • • (1129) 

i=t 

If we substitute this expression for the state into the eigenvalue equation (11.3), 
instead of (11.11 b) we obtain 


"oiv.Vb + «.E c >S>= eIV’) + K°T, 


(11.30) 


i=t 


i = l 


4 Strictly, there are N different first-order corrections for the N different IV',,). We have 
suppressed an extra subscript in labeling these states for notational simplicity. 
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We then take the inner product of this equation with each of the N bra vectors I, 
leading to 


N 


E = e "’E < il3I > 


i=i i=i i=i 

where the last step follows from the assumption that the degenerate states are 
orthonormal, that is, they satisfy 


/ ( 0 ), ( 0 ) } , 

' Y n,j' Y n, i 1 J> 

On the left-hand side of (11.31) we see the matrix elements 


(<rZ\HyS)=wji 


(11.32) 


(11.33) 


of the perturbing Hamiltonian in the subspace of degenerate states. In fact, in this 
subspace, (11.31) is just the standard eigenvalue equation. For example, in the case 
that N — 2, (11.31) can be written in matrix form as 



The lirst-order energy shifts will be the eigenvalues of this equation and the corre¬ 
sponding eigenstates will be the proper linear combinations of the degenerate states. 
Of course, if, by chance, we had initially chosen the proper linear combination of 
states, we would have found that the matrix representation is diagonal, with the first- 
order shifts in the energies as the diagonal matrix elements. Thus we can say that in 
determining these first-order shifts we are diagonalizing Thepenurbing Hamiltonian 
in the subspace of degenerate states. 5 


11.3 The Stark Effect in Hydrogen 


As an interesting illustration of degenerate perturbation theory, let's consider what 
happens when we apply an external electric field E to the hydrogen atom, producing 
the Stark effect. We expect a perturbing Hamiltonian of the form 

H\ = — fi e • E = er ■ E (11.35) 

where the electric dipole moment p e of the hydrogen atom is —er, since the radius 
vector r points from the proton to the electron, while the dipole moment points from 


5 It .should be emphasized that we are not diagonalizing the perturbing Hamiltonian in the 
space formed by the (often infinite) complete set of eigenstates of H 0 . If we were able to carry out 
this diagonalization, we would be able to find the exact eigenstates of Hq + and we would not 
need to resort to perturbation theory. 
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the negative to the positive charge. Of course the unperturbed Hamiltonian is ju 



(II. 


with eigenstates |n, /, m). 

We choose to orient our coordinate axes so that the electric field points in th 
direction. The electric dipole Hamiltonian becomes 


«i = e|E|f 


(II.. 


We first consider the ground state, for which we can utilize nondegenerate pertur 
tion theory to calculate the first-order shift in the energy: 

£{ 1) = e|E|(l, 0, 0|z|l, 0, 0) = 0 (II.. 


The expectation value vanishes since eigenstates of the hydrogen atom with defir 
orbital angular momentum / have definite parity (—1) / (see Problem 9.15). Thus, 
for the first-order correction to the harmonic oscillator in an external electric fi 
in Example 11.1, the expectation value (I 1.38) in position space involves an c 
function, which integrates to zero. 

The second-order shift in the ground-state energy is given by 

r-( 2 ) v- e 2 \E\ 2 \{n,l,m\z\],0.0)\ 2 

£ i = 2- - P ( 0 ) _ -to,- (11 - 

n;&l./.m C I c 'i 


Notice that the sum is over all states except the ground state. Although this suit 
not as easy to evaluate as the one for the harmonic oscillator, the physics in the t 
cases is essentially the same. 6 Here again, the atom in the ground slate does not ht 
a dipole moment, as indicated by (11.38), but one is induced by the applied elect 
field, generating a shift in the energy proportional to E : . 

Let’s now mm our attention to the first excited states of hydrogen, where l 
principal quantum number n is two and there is a four-fold degeneracy, ignoring sp 
We first construct the 4x4 matrix representation of H\ using the lour degener; 
states |2, 0, 0), |2. 1, 0), 12. 1, 1), and |2, 1. —1) as a basis: 


l (2. 0,0| Wi|2, 0. 0) 
(2, 1. 0|W,|2, 0, 0) 
(2, I. l|ff,|2. 0, 0) 
V ( 2 , 1 . - 11 / 7 , 12 , 0 . 0 ) 


<2, 0, 0|H,12, 1,0) 
<2, 1, 0|H,|2, 1,0) 
(2. 1, l|//,|2. 1.0) 
(2, 1. —11/7,12, 1,0) 


(2, 0, 0|ff,|2, 1. 1) 
(2, 1.0|tf,|2, 1. 1) 
(2, 1. l|tf,|2, I, 1) 
(2. l.-I|W,|2. 1, 1) 


(2, 0. 0|A/||2, I, -1) 
(2, I, 0|tf,|2, 1,-1) 
(2. 1, I|//ii2, 1.-1) 
(2, 1. -l|tf,|2, 1,-1 
(11.- 


6 For an exact calculation of the second-order Stark effect, see S. Borowitz, Fundamental: 
Quantum Mechanics , W. A. Benjamin, New York. 1967. 
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We have chosen a particular order for the states in this matrix for reasons that will 
become apparent shortly. 

Evaluating 16 matrix elements and then diagonalizing a 4 x 4 matrix is straight¬ 
forward, but it does not seem like a particularly enjoyable task. However, as is 
frequently the case in applications of degenerate perturbation theory, there arc sym¬ 
metry arguments that allow us to deduce without explicit computation that many of 
these matrix elements vanish. For example, as in (11.38), we can use the parity argu¬ 
ment to deduce that all the diagonal matrix elements must vanish. In fact, since the 
evenness or oddness of the wave functions depends on the value of / alone and not 
the value of m, we see that all the matrix elements where the ket and the bra have the 
same / vanish. Thus the only nonzero matrix elements can be the off-diagonal matrix 
elements in the first row and first column of the matrix. Moreover, with the electric 
field pointing in the z direction, the perturbing Hamiltonian is invariant under rota¬ 
tions about the z axis, and thus the Hamiltonian commutes with the corresponding 
generator of rotations, L z . Explicitly, the perturbing Hamiltonian just involves the 
position operator z, and from (9.72c) we see that 

[H,,LJ = 0 (11.41) 


Consequently, 


m'h(n, /', m'\z\n, I, m) = (n, /'. m'\L z z\n, l. m) 

= (n, /', m'\zL z \n, I. m) 

= mti{n , l\ tn'\z\n. I, m) (11.42) 


and therefore 


(n, I', m'\z\n, l, m) = 0 m^m' (11.43) 

The vanishing of the commutator (11.41) dictates that matrix elements of the per¬ 
turbing Hamiltonian with different m’s vanish. 

Thus the only matrix element in (11.40) that we need to evaluate explicitly is 

(2.0.01/7,12, l,0) = e|E|(2,0. 0|z|2, 1.0) (11.44) 

Using the position-space radial wave functions (10.44), the spherical harmon¬ 
ics (9.151) and (9.152b), andz = /• cos#, we find 

/*og /'Tt rhr 

(2. 0. 0|//,|2, 1, 0) = e|E| / r 2 dr I sin 6> </# / d<\> R* 0 Y* 0 r cos#/?, ,K, 0 

Jo Jo Jo 

= —3e|E|a 0 (11.45) 
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a here the length a 0 is just the Bohr radius of hydrogen. Therefore, the 4 x 4 
matrix (11.40) is given by 


/ 0 —3e|E|a 0 0 0\ 

—3e|E|n 0 0 0 0 

0 0 0 0 

^ 0 0 0 Oy 


(11.46) 


where we have taken advantage of the Hermiticity of the Hamiltonian to relate the 
value of the matrix clement in the second row, first column to that in the first row, 
second column. 

Thus for the Stark effect in hydrogen, (11.31) can now be written as 


/ 0 —3<?|E|a 0 0 0\ 




fcA 

—3cjE|a 0 0 0 0 


c 2 

= e { 2 u 

c 2 

0 0 0 0 


q 


q 

^ 0 0 0 0) 


V C 4V 


\q/ 


Recall that for this equation to possess a nontrivial solution, the following determi¬ 
nant must vanish: 


-E™ —3e|E|a 0 0 0 

—3e|E|a 0 —E\ ]) 0 0 

0 o ~ e ( 2 ]) 0 

0 0 0 —e 2 ] 


(11.48) 


The four values for the first-order shifts in the energy are 


E ( 2 ]) = 0, 0, 3e|E|a 0 , -3e|E|a 0 (11.49) 


If we substitute these values into (11.47), we find that the corresponding linear 
combinations of the degenerate eigenstates are given by 


12 , 1 , 1 ), 12 , 1 ,- 1 ), ±(\ 2 , 0 . 0 ) - | 2 , 1 . 0 », 

v2 s/2 


(| 2 , 0 , 0 ) + | 2 , 1 , 0 » 


(11.50) 


respectively, as indicated in Fig. 11.5. Again, as a consequence of (11.41), we sec 
that the two states with the same m values are the only ones mixed together by the 
perturbation. Therefore, we could have chosen initially to concentrate our efforts 
in degenerate perturbation theory on these two states alone and formed at most the 
2x2 matrix in the upper left-hand corner of (11.48). Finally, notice that when there 
is degeneracy, there is an energy shift linear in the applied field, as compared with 
the quadratic effect for the ground state. Although each of the states |2, 0, 0) and 
|2, 1, 0) has a definite parity, the linear combinations of these states in (11.50) do 
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/-1- ^12,1,0>-^12,0,0) 

3*dE|ao 

/ i 

- < -1- |2,1,1>,|2,1,-1) 

I 

3<?|EI«o 

\ I 

''- 1 - ^= 12 , 1 , 0 )+^ 12 , 0 , 0 ) 

Figure 11.5 The first-order shifts in the energy levels of the 
n = 2 states of hydrogen in an external electric field. 


not. Consequently, these linear combinations can have a nonvanishing expectation 
value of the electric dipole moment, which can then interact directly with the applied 
electric field. 

11.4 The Ammonia Molecule in an External Electric 
Field Revisited 


With these results in mind, let’s return to the example of the NH 3 molecule in an ex¬ 
ternal electric field with which we started our discussion of perturbation theory. First, 
using perturbation theory, we consider the case of a weak field. The eigenstates of 


/<l|/7 0 |l} <l|ff 0 |2)\ I E 0 -A\ 
V<2|// 0 |1) (2\H 0 \2) )-\-A E 0 ) 


(11.51) 


are the states 1 1) and | II) given after equation (11.9). If we use these states as a basis, 
the matrix representation of H 0 is diagonal, 


/ </|ff 0 |/> (i\H 0 \n) 

\(II\H 0 \I) <//|4|//> 


E 0 -A 0 \ 

0 £q -F A ) 


(11.52) 


as we saw in Section 4.5, while the matrix representation of the perturbing Hamil¬ 
tonian is given by 


I (I\H,\I) </|£,|//)\ / 0 M,|E|\ 

V(//|H 1 |/> (IIIHJII)) ~ U C |E| 0 ) 


Since the parity operator fl inverts states through the origin, the effect of applying 
the parity operator, indicated in Fig. 11.6, is to take the state i 1) of the molecule, in 
which the N atom is above the plane formed by the H atoms, and change it into |2), 
in which the N atom is below the plane: 


Similarly, 


nil) = 12} 


n|2) = |i) 


(11.54a) 

(11.54b) 
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Figure 116 The action of the parity operator on state 11) of the NH ? 
molecule, with the N atom above the plane formed by the three H 
atoms, as shown in (a), produces state |2>, with the N atom below the 
plane, as shown in (b). 

Thus both the ground state, |/>, and the first excited state, |//), are eigenstates of 
parity: 

fti/ > : - ■ ft (>> + >>) - (>> + >>)= 1,1 <’ 
ni//> = ft (-1|» - d=P>) = O - >>) = -m " 1.55b) 

Therefore, as shown by the vanishing of the diagonal matrix elements of the perturb¬ 
ing Hamiltonian (11.53), the first-order shift in the energy due to an external electric 
field is zero, since the electric dipole moment operator has a vanishing expectation 
value in a state of definite parity. Our first-order results tire in agreement with the ex¬ 
act result (I 1.9), showing that the molecule exhibits an energy shift that is quadratic 
rather than linear in the applied lield. 

What happens if the electric field is a strong field satisfying /r e |E| » A"! If we 
were still permitted to use nondegenerate perturbation theory with (11.53) as the 
perturbing Hamiltonian, the first-order shifts in the energies would vanish. However, 
from the exact eigenvalues (11.8) we see that 

E = E 0 ±n e \ E|±-^—ep.-. Mf |E|»A (11.56) 

which has a leading term that is linear in the field. The reason for this discrepancy is 
that for /x e |E| »Awe really need to use degenerate perturbation theory. Although 
the states |7) and |//) are not strictly degenerate, they are close together in energy. 
The energy difference between them is 2 A, which is much less than the energy /x c |E| 
for strong fields. Thus the magnitude of the factor (11.27) is 
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(umi) 

rCO) _ r(0) 

£L U 


He !E| 
2A 


» 1 


(11.57) 


and we cannot expect nondegenerate perturbation theory to work. 

Let’s see how we combine perturbation theory 7 with matrix mechanics to work 
out the terms of the series (11.56). In the strong-field limit we can include the dipole 
moment interaction as part of H 0 and break up the Hamiltonian matrix (11.7) in the 
11)-|2) basis as follows: 


<>W>W*. + ,,|E| 0 \ (11.58a) 

\<2|H„|1) a\H„\2)> V 0 £„ - n,iEi ) 

/(UAH) <W>W<> -A y 

V <2| //, 11> <2|fl,|2>/ \-A 0 t 


Clearly, the eigenstates of this H a are just the states |1) and |2) with eigenvalues 
£ j 0) = £ 0 + /x,|E| and £' 0) = £ 0 — /x e |E|, respectively. The first-order shift in these 
energies in the strong-field limit vanishes: 


E« = (2|H i |2) = (0,1)(_° a “ A )(“)=0 


(11.59a) 

(11.59b) 


while the second-order shift is given by 




F &) i<2|j/,idi 

r-(0) 


£} 0) - £f Eo + M,|E| - (£ 0 - Helm 2fx e \E\ 


-c :) 0 |2 


77 ( 2 ) |< 1 |//,| 2 >| 

^2 — n 


Ef - £| 0) Eo ~ He\E\ - (£ 0 + |E|) 2/r,|E| 


(11.60a) 


(11.60b) 


These results agree with the expansion (11.56). 

In the next section we will examine perturbations to the hydrogen atom due to 
internal relativistic effects. These perturbations partially break the degeneracy of the 
four n — 2 states, for which we used degenerate perturbation theory in the previous 
section to work out the Stark effect. But the message of this section is that these 
relativistic effects don't obviate the need for degenerate perturbation theory as long 
as the magnitude of the matrix element (11.44) of the perturbing Hamiltonian is large 
compared with the energy scale of these relativistic effects. In general, whenever the 
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unperturbed states are “close” together in energy, we should include them in the 
mbspace of states that we use to form the matrix representation of the perturbing 
Hamiltonian. 

11.5 Relativistic Perturbations to the Hydrogen Atom 


Although the agreement between the observed spectrum of hydrogen and our theo¬ 
retical predictions of Section 10.2 is excellent, there is a fine structure to these energy 
levels that we haven’t accounted for at all. Overall, there are three relativistic pertur¬ 
bations to the Hamiltonian (11.36) of the hydrogen atom that contribute to the fine 
structure: a relativistic correction to the electron’s kinetic energy, a spin-orbit inter¬ 
action, and the Darwin term. The spin-orbit interaction couples together the intrinsic 
spin and orbital angular momentum of the electron. 


THE RELATIVISTIC CORRECTION TO THE KINETIC ENERGY 

One obvious relativistic perturbation is that the kinetic energy in (11.36) arises from 
a completely nonrelativistic approximation. Instead of expressing the kinetic energy 
operator of this two-body system as 


n 2 p 2 

2 m e 2m p 


(11.61) 


we use the relativistically correct expression for the electron’s kinetic energy, in 
which case 


K = y p 2 c 2 + ( m e c 2 )~ — m e c 2 + 


-2 
2m , 


= m eC 2 (-y/l+ (p 2 /m 2 c 2 ) - 1 ) + 


(11.62) 


Expanding the square root in a Taylor series, we find 


K - 


_p;_ 

2 m„ 8 mlc 2 




2m , 


(11.63) 


In the center-of-mass frame (see Section 9.3), the kinetic energy operator can then 
be written as 


- p 2 (p 2 ) 2 

2 n 8 m?c* 


(11.64) 


In deriving (11.64), we have ignored the relativistic correction to the proton’s kinetic 
energy because m p 3> m e . 
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The unperturbed Hamiltonian for a hydrvgenic atom is the usual 


Hn — ~ 


(11.65) 


In an energy eigenstate 


(n. /, l, in) = -Ef® = 

2 fi 2 n- 


(1 1 . 66 ) 


(see Problem 11.17). Because of the small value of a. for modest values of Z the 
average kinetic energy is much less than the rest-mass energy, and therefore the atom 
is quite nonrelativistic. We thus can treat 


H,= 


(p 2 ) 2 

8m ?c 2 


(11.67) 


as a perturbation on the Hamiltonian (11.65). Notice that (11.67) is rotationally 
invariant and therefore 


[H k . LI = 0 


( 11 . 68 ) 


Thus, although the eigenstates |n, /, m) of H 0 are highly degenerate, the matrix 
representation of the perturbing Hamiltonian (11.67) in each degenerate subspace is 
already diagonal, and we can calculate the lirst-order energy shift as 




(11.69) 


We could evaluate (11.69) directly in position space, letting the operator (p 2 ) 2 -* 
(-/) 2 V 2 ) 2 differentiate the wave function (r|;/, /, m) = R n t (r )Y I m (8, <p). and so on. 
Fortunately, there is a better way. We can simplify the evaluation by rew riting the 
operator (11.67) in the form 

= — 1 ( iLV = __L U + z*L\U+*L\ duo) 

8m 2 c 2 2 m e c 2 \2 m e ) 2m e c 2 y l r l / \ r| / 

where we have ignored the difference between the reduced mass of the hydrogen 
atom and the mass of the electron in the perturbation. Thus 


£ ">__ 
c n./ — 


— (E^) 2 + 2 £< 0) (n, /, m\^-\n, /, m) + («, /. m\- 
2 rn e c 2 |r| i 


—I n, 1. m) 

r l' 


(11.71) 


From Problem 11.16 


~{n, /, m \—— |n, /, m) =2E (0) 

r 


(11.72) 
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and from Problem 11.18 


. . ,Z 2 e 4 , . v ZV 4 (£«») 2 /i 

{/ j , /, m\-^Y\n, I. m) = --- '■ - 


rr %n 3 0 + 4) 1 +1 


(11.73) 


Thus the first-order shift in the energy due to the relativistic correction to the 
electron’s kinetic energy is given by 


r(D 1 2 7 4 4 

E k — — ~m f cZ a 


-T + 


4n 4 n 3 (/ + 4) 


(11-74) 


SPIN-ORBIT COUPLING 

In order to determine the form of the spin-orbit interaction, we start with a classical 
argument. In the rest frame of the electron, the motion of the proton generates a 
current, which, from the Biot-Savart law, produces a magnetic field 


B = 


—Zex x r 


c r 


3 


(11.75) 


where —v is the proton’s velocity, which is equal and opposite to the velocity v of 
the electron in the proton’s rest frame. The energy of interaction of the electron’s 
intrinsic spin magnetic moment with this magnetic field is given by 


-/i B 




X e ; 

2m,c 


—Zex x r 


c r-’ 


)- 


Ze 2 
m?c 2 r 3 


S L 


(11.76) 


where L = r x p is the electron’s orbital angular momentum. We have also taken 
g = 2 for the electron. 7 

Equation (11.76) might not seem like a truly relativistic effect. However, we can 
express the magnetic field (11.75) as 


B = -(x/c) x E 


(11.77) 


where E is the electric field in the electron’s rest frame. Magnetic effects, which 
depend on the motion of charges, arc all inherently relativistic, as the factor of x/c 
in (11.77) suggests. In fact, in “deriving” (11.76), we have made a relativistic error. 


It is interesting to see how the factorof 1/c 2 in (11.76) arises in SI units. Since n = — (e/m e )S 
and B = (/t 0 /4nr)[(—Zev) x r/r 3 ] in SI units, 

„ Ze 2 (n 0 e 0 ) S • L Ze 2 S L 

— fl ■ D = - — - — ~ --- 

47r£ 0 mjV 3 4 jT£ 0 rn^c-r 3 

where we have used /to£o = 1/c 2 in the last step. Titus, as is the ease for expressions such as 
the potential energy —Ze 2 /r in Gaussian units, we can go from (11.76) to the corresponding 
expression in SI units with the replacement e- -*■ e 2 /4.Te 0 . 
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which was first discovered by L. Thomas and is called the Thomas precession effect. 
This effect simply reduces the energy of interaction (11.76) by a factor of two. We 
will not derive the Thomas factor here. 8 The best way to obtain the full spin-orbit 
interaction Hamiltonian 


A-o=—(11.78) 
2m-c-\r\ 

is from the nonrelativistic limit of the famous Dirac equation with a Coulomb 
potential energy. The Dirac equation is the fully relativistic wave equation of a 
spin-j particle, such as the electron. This equation, for example, predicts that g — 2 
for the electron, so relativistically we don’t have to insert this factor by hand based 
on experimental results, as we have done so far. 

We are now ready to treat the Hamiltonian (11.78) as a perturbation. Let’s 
concentrate first on the L ■ S part of the interaction, which is reminiscent of the 
spin-spin interaction Sf • S 2 (see Chapter 5) that couples together the spin angular 
momentum states of two spin-f particles. Here the story is essentially the same, 
except that one of the angular momentum operators is orbital and the other is intrinsic 
spin. We can form a basis as a direct product of the orbital angular momentum and 
intrinsic spin states: 

|/, m, +z) = |/, m) ® |+z) = |/, m) <g) ||, 5 ) (11,79a) 

|/, m, -z) = 1 1, m) ® |— z) = \L m) ® ||, - j) (11.79b) 

We can form simultaneous eigenstates of L 2 and L, as well as S 2 and 5., since the 
orbital and spin angular momentum operators commute with each other. After all, 
L generates rotations in position space, while S generates rotations independently 
on spin states. Thus the operator that generates rotations of both the spatial and spin 
degrees of freedom is the total angular momentum operator 

J = L + S (11.80) 

Diagonalizing the interaction Hamiltonian (11.78) means finding the eigenstates 
of L - S. Just as the eigenstates of • S 2 arc eigenstates of total spin, the eigenstates 


8 See, for example, R. Eisberg and R. Resnick, Quantum Physics of Atoms, Molecules, Solids. 
Nuclei, and Particles, 2nd cd., Wiley, New York, 1985, Appendix O. Thomas's discovery provided 
the mysterious factor of two necessary' to make Goudsmit and Uhlenbeck’s intrinsic spin hypothesis 
fit the spectrum of hydrogen, Uhlenbeck has noted that “it seemed unbelievable that a relativistic 
effect could give a factor of two instead of something of order v/c” and “even the cognoscenti of 
the relativity theory (Einstein included!) were quite surprised." Physics Today , June 1976, p. 48. 
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f I. • S are eigenstates of total angular momentum j 2 and j z . where 

J 2 = L 2 + S 2 + 2 L • S (11.81a) 

J Z = L Z + S Z (11.81b) 


From the expression 


A A A* A A A a 

2 L ■ S = J 2 — L 2 — S 2 (11.82) 

it is easy to see that J z commutes with L • S. since commutes with J 2 , L z commutes 
with L 2 , and S z commutes with S 2 . Or we can evaluate the commutator explicitly, 
which shows that although neither L. nor S, commutes with L • S, the operator J, 
does : 9 

[J z , 1 • S] = [L z + S z , L • SJ = [L z + S z , L X S X + LySy + L Z S Z ] 

= [L. L X ]S X + [L z , L y ]Sy + [S z , S X \L X + [S z , Sy]Ly 

= ifiL v S x — ifiLySy + ifiS v L x — ifiS x L y = 0 (11.83) 

Since these operators commute, we can find eigenstates of L • S that are simultaneous 
eigenstates of J,. 

For the hydrogen atom, this substantially simplifies the job of determining the 
linear combinations of degenerate states that diagonalize the perturbing Hamiltonian. 
Counting the intrinsic spin states of the electron, there are In 2 degenerate states for 
any given n. However, (11.83) shows that only slates with the same eigenvalue for 
J. can be mixed together by the perturbation. Also, since // s _ 0 commutes with L 2 . 
we can focus on states with the same value for I. For a fixed /. there are just two 
stales with the eigenvalue of J z equal to (m + \)li: 

\l, m, +z) |/, m + 1, — z) (11.84) 

assuming that m ^ l ; otherwise, there is only a single state. In order to determine the 
two linear combinations of these states that are eigenstates of 2L • S, we use these 
two states as a basis to form the matrix representation of the operator. Using the 
identity 


2 L • S = L + 5_ + L_S + + 2L z 5 2 (11.85) 


9 One can also argue that L S as a dot product of two vector operators is invariant under total 
rotations and that J. is a generator of total rotations about die z axis and must therefore commute 
with L • S. 
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and the general results for the action of the angular momentum raising and lowering 
operators, we find that the matrix representation is given by 


2 L • S -> /r 


//(/ + 1) — mini -f 1) 


//(/ 4- 1) — m(in + 1) 
~(m + 1) 


( 11 . 86 ) 


where we have ordered the two basis states in (11.84) as |1) = |/, m, +z) and 
|2) = |/, m + 1, —z) in constructing this matrix representation. 

The eigenvalue equation 


2 L ■ S|a) = Xh 2 \X) 


has nontrivial solutions provided that 


m — X v T(l + 1) — m(m + 1) 

/l(l + 1) — m(m 4- 1) — (m + 1) — A 


(11.87) 


( 11 . 88 ) 


X 2 + X — /(/ 4- 1) = 0 


(I 1.89) 


The two solutions are X — / and X = —(/ + 1). By substituting these eigenval¬ 
ues into (11.87) in matrix form, we can determine the linear combinations of the 
states (11.84) that are eigenstates of L ■ S. Since each of the states (11.84) is an 
eigenstate of L 2 with eigenvalue /(/ + l)fi 2 and S 2 with eigenvalue ^(1 + 1 )H 2 , 
these linear combinations are also eigenstates of J 2 , as given in (11.81 a). The value 
of the total angular momentum quantum number j is then determined by 


JU + !)=/(/ +l) + 1(^ + 1) + 


-(/ + D 


(11.90) 


which yields the two solutions 


(11.91) 


Thus the L • S interaction term has coupled the orbital angular momentum l together 
with the spin angular momentum i to produce a total angular momentum j that takes 
on the values / + 4 and l - \ - The eigenstates are given by 

I;■ = / + nij) = |/, m, +z) + 1 1, m + 1, -z) (11.92a) 

\j = / - 171 j) = \h m, +z) - ^ l+ 2i m + \ l I/, * + 1- -*) 0 1 -92b) 
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w ith m , = m + The right-hand side of these equations can be expressed directly 
m terms of m, as 


\j = l±{, ntj) 


j 1 ± ntj + 
21 + 1 


\l, rtij — j, +z) 


\l^m r . 

± V 21 + 1 + 5* -z ) 


(11.93) 


We now know the linear combinations of the basis states (11.84) that diagonalize 
the perturbing Hamiltonian (11.78) in the subspace of degenerate states. The energy 
shift due to the spin-orbit interaction is given by the expectation value of this 
Hamiltonian for these states: 


ly ^7. 

E s-o = J' , „., 3 l ■ S|«. y> 

2m 2 c 2 |r| 

- Zg2fi2 / 1 \ ( 7 J = 1 + 3 

4 mjc 2 \r 3 l n j { -(/ + 1 ) j =1 ~ { 

Since from Problem 11.19 

l—\ - Z 3 

\r*) nJ ~ aln*l(l+-{)(! + 1) 


(11.94) 


(11.95) 


we can express the first-order shift in the energy due to the spin-orbit interaction as 



m e c?Z 4 a 4 
4n 3 /(/ + $)(/ + 1) 


/ y-/+i 

-(/ +1) y=/-4 


(11.96) 


The spectroscopic notation that is used to label these states. Is, 2s, 2p, and so on, 
where the number in front is the principal quantum number and the letter indicates 
the orbital angular momentum (/ = 0 is s, l = 1 is p, 1 = 2 is d, .. .), now' needs to 
be enlarged to specify the total angular momentum as well. This is done by adding 
a subscript indicating the value of j. For / = 0, the value of j must be A; for / = 1, 
its value is \ or 4, while for / = 2, the value of j is 4 or 4. and so on. Thus the states 
of the atom, including the total angular momentum, are lS(/ 2 , 2 .? l/2 , 2p Xjl , 2 p 3 / r>, 
and soon. Equation (11.96) shows, for example, that the 2p X n and 2/7 3/2 states have 
different energies when the spin-orbit interaction is included. 

As we will discuss in the next chapter, this labeling can also be extended to 
mulliclectron atoms. Multielcclron atoms that are quite similar to the single-electron 
hydrogen atom include the alkali elements, such as sodium. For sodium in the ground 
state, 10 electrons fill up the l.v, 2.v, and 2 p energy states, while the 11th electron 
is in the 3.s level. Although the 3 s electron tends to reside in a shell that is outside 
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3/ty2 

•Vi/2 


A = 5890 A 


A = 5896 A 


3*1/2 


Figure 11.7 The spin-orbit splitting of the 3 p l/2 and 
-V 3/2 levels leads to a line structure that is responsible 
for the sodium D lines. The energy difference between 
the 3* and 3 p levels results from the fact that the 
potential energy experienced by the n = 3 electron is 
not a pure — e 2 /r Coulomb potential. 


the other 10 electrons, the radial wave functions shown in Fig. 10.5 reveal that its 
wave function penetrates inside the electron cloud formed by the inner electrons. It is 
thus only partially shielded from the nucleus with its Z = M positive charge, and its 
energy is reduced from the n = 3 value (10.34) for hydrogen. Unlike the / = 0 states, 
the wave function of the 3 p electron vanishes at the origin. It thus doesn't "see” the 
nucleus as much as does the 3 s electron, and consequently its energy is not reduced 
as much. Thus the degeneracy between the 3* and 3 p energy states that is present in 
hydrogen is broken in the sodium atom. The spin-orbit interaction then adds a fine 
structure to the sodium energy levels. In particular, the difference in energy between 
the 3pi/ 2 and 3 py 2 states is responsible for the two closely spaced yellow lines in the 
spectrum, known as the sodium D lines, which are produced when the atom makes 
a transition from the 3p to the 3 s state (see Fig. 11.7). 

THE DARWIN TERM 

If we evaluate (11.96) for / = 0, we obtain a fini te result. Of course, in a state with 
zero orbital angular momentum, there cannot be any spin-orbit interaction. The finite 
result arises because the expectation value (11.95) of 1 fr' in the hydrogenic wave 
functions has a 1// dependence that cancels the factor of / from the eigenvalue of 
2 L • S for a state with j — l + \. In fact, if we were to evaluate the expectation 
value (I 1.95) using more exact relativistic wave functions from the Dirac equation, 
we would find that there is actually no spin-orbit contribution for / = 0. as you would 
expect physically. How'ever. a perturbative solution to the Dirac equation shows that 
there does exist an additional interaction that we have not included in our discussion 
of relativistic perturbations. 

The Dirac equation is a four-component wave equation, as compared with the 
two-component spinors that we introduced in Chapter 2 to represent the spin states 
of spin-f particles. When one reduces the equation to an effective Schrodinger- 
like equation by eliminating the lower two components, one finds in addition to 
the perturbations (11.67) and (1 1.78) an additional perturbation of the form 

H 0 = ^3 [P‘> TP, V ( |f |} ]] (11.97) 
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where the momentum operators are dotted with each other. 10 Thus H D is rotationally 
invariant like H K , and we can calculate the first-order energy shift by 

£p" = (n, /, [p, V (|r|)]]|/i, /, tn) 

= /A|*„.,| 2 |r,j^7 2 v 

= / d 'r \Rn.,\ 2 \y,. m \ 2 ^0^-S\r) (11.98) 

Since only / = 0 states are nonzero at the origin, this Darw'in term contributes only 
for s stales. In fact, the magnitude of this contribution 

a 7>7 ('" / (y 

(n, 0, 0|// D |/i, 0, 0) = (11.99) 

turns out to be exactly the same as the spurious / = 0 contribution from (11.96) for 
the spin-orbit interaction. 

Why does the Dirac equation have four components instead of two? Any quantum 
mechanical relativistic description of particles must include the antiparticles as well 
as the particles—in this case positrons as well as electrons. Each of these particles 
is a spin-4 particle, and thus we end up with a four-component equation. Why must 
the positrons be included in our treatment? One way to see this is to go back to the 
energy-lime uncertainty relation (4.63) and note that for time intervals 

Af ~ —-r (11.100) 

m e c 2 

the uncertainty in the energy A £ ~ 2 m e c 2 . which is sufficient to create an electron- 
positron pair. Thus, in addition to an amplitude for the hydrogen atom to be an 
electron and a proton, there is an amplitude for the atom to be an electron, a proton, 
and an electron-positron pair. In fact, you can see that for sufficiently short time 
intervals, the atom can be teeming with activity with many pairs of electrons and 
positrons, and even particle-antiparticle pairs of heavier particles as well. It is this 
sort of behavior that makes quantum field theory a complicated many-particle theory. 


10 For example, see R. Shankar. Principles of Quantum Mechanics, 2 nd edition. Plenum, New 
York, 1994. pp. 569-574. Other references for learning about the Dirac equation include J. D. 
Bjorken and S. D. Drell, Relativistic Quantum Mechanics, McGraw-Hill, New York. 1964, and 
J. J. Sakurai, Advanced Quantum Mechanics. Addison-Wesley. Reading, MA, 1967. This latter 
book is highly recommended for its excellent discussion of the physics associated with the Dirac 
equation, although it does use a somewhat old-fashioned ict metric. 


Page 423 (metric system) 




11.5 Relativistic Perturbations to the Hydrogen Atom 


407 


V 



Figure 11.8 Fluctuations on the distance scale h/m e c 
produce a significant change in the potential energy 
near the origin. 


In an atom containing electron-positron pairs in addition to the usual electron and 
proton, the concept of a simple potential energy of interaction between the electron 
and the proton must break down. This breakdown occurs on the distance scale 

f- 

Ar^c&t ~—— (11.101) 

m e c 

roughly the Compton wavelength of the electron, which becomes effectively the 
electron’s charge radius. Note that (11.101) is a factor of or smaller than the Bohr 
radius a 0 of the atom. It is interesting to note that if we replace the potential energy 
with a smeared average over this distance scale, we obtain 

V = V(D + A? ■ VV + ^ ~^ Xr jV~T- + • •' 

2 77 dx ‘ dx J 

= V(r) + -Ar 2 V 2 V-h-- = V(r) + - (— ) V 2 V + --- (11.102) 

6 6 \m e cj 

where we have assumed that the vector displacements average to zero and that there 
is spherical symmetry. Equation (1 1.102) yields the same form for the perturbing 
Hamiltonian that appeals in the middle of (11.98), except the factor of £ is replaced 
by i. As Fig. 11.8 shows, fluctuations on the distance scaled 1.101) for the Coulomb 
potential have a substantial effect only near the origin, and that is why only s states 
are affected. 
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11.6 The Energy Levels of Hydrogen 


Adding the energy shifts (11.74), (11.96), and (11.99) together, we obtain 


£ (l) 4 - £ ll) + E 
T ^s-o + C D 


(i) 



m e c 2 (Za) A 
2 u 3 




(11.103) 


Notice that the magnitude of the total energy shift is of order (Za) 2 times the unper- 
turbed energy (10.34) of the atom. In particular, for hydrogen (Z = 1), the energy 
shift is roughly 10~ 5 as large as the unperturbed energy. Thus the perturbations do 
indeed contribute a fine structure to the energy levels—hence the name for the 
fine-structure constant a. Figure 11.9 shows the energy-level diagram for hydrogen, 
including this fine structure. Also note that although each of the individual energy 
shifts (11.74), (11.96), and (11.99) depends on the value of/, the total shift does 
not. This surprising degeneracy is actually maintained to all orders in the relativistic 
perturbation when the Dirac equation with a Coulomb potential is solved exactly. 11 

In 1947 W. E. Lamb and R. C. Retherford observed a very small energy differ¬ 
ence between the 2 j 1/2 and the 2 p ]j2 levels through the absorption of microwave 
radiation with a frequency of 1058 MHz, corresponding to an energy splitting 
of 4.4 x 10 -6 eV (see Fig. 11.10b). 12 This Lamb shift, which is of the order 
m e c 2 (Za) 4 ot log a, can be explained by quantum electrodynamics in terms of the in¬ 
teraction of the electron with the quantized electromagnetic field. 13 The Lamb shift 
has been measured to five significant figures, providing one of the most sensitive 
tests of quantum electrodynamics (QED). Note that the magnitude of the Lamb shift 
is roughly 10~ 6 of the spacing between levels that produce the Balmer series. Thus, 
measuring the shift itself with an accuracy of one part in 10 5 by detecting the differ¬ 
ence in wavelength of visible photons emitted as the atom makes transitions from 
higher energy states to the 2s l/2 or 2 p l/2 states would require a resolution of 1 part 


11 The exact energy eigenvalues for the Dirac equation with a Coulomb potential are given by 




~ 

( \ 2 " 
Zc* 

-1/2 

E n.J = m e cl 


1 + 

| 


ytt — (j + 1) + y (j + j) 2 — (Za) 2 J 



12 W. E. Lamb and R. C. Retherford. Phys. Rev. 72. 241 (1947); 86. 1014 (1952). This latter 
paper contains their most precise results. Lamb received the Nobel prize in 1953 for this work. 

13 We examine the quantized electromagnetic field in Chapter 14. However, we will not attempt 
to work out the value of the Lamb shift, which is itself a taxing problem. For an interesting 
discussion of the difficulties that this calculation presented to R. P. Feynman and H. Bcthe. two of 
the more clever physicists at performing calculations, sec Feynman's Nobel prize speech in Nobel 
Lectures — Physics, vol. Ill, Elsevier, New York, 1972. 
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n = 3 


-V-V2 

3 d 5n 
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-T 


n = 2 
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^3 d 3r . 
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Figure 11.9 An energy-level diagram for the n = 1, 
n = 2. and n = 3 levels of hydrogen, ineluding tine 
structure, which is exaggerated in scale by roughly 
a factor of 10 4 . States with different l's and the same 
j and n are degenerate. 


in 10 11 ! The main reason that we can isolate these QED corrections experimentally 
with such precision is the fortunate degeneracy, apart front these QED effects, of 
the 2.VJ/2 and 2states. In essence, the experiment of Lamb and Retherford is a 
sensitive null lest in that any absorption at the appropriate microwave frequency is an 
indication of an energy splitting that cannot be explained through purely relativistic 
effects. 

In our discussion of the perturbations for the hydrogen atom, we have so far 
neglected the effect of the proton’s spin. As we discussed in Chapter 5. the proton's 
intrinsic magnetic moment interacts with the electron's magnetic moment, leading 
to a hyperfine interaction. When we include the proton's spin degrees of freedom as 
well as those of the electron, the ground state has a four-fold degeneracy, which is 
split into two energy levels by the hyperfine interaction, as indicated in Fig. 11.10a. 
When the atom makes a transition between these two levels, it emits a photon with a 
frequency of 1420 MHz, or a wavelength of 21 cm. This energy splitting, which is on 
the order of (m e /m p )a 4 m e c 2 , is a factor of (m e /m p ) smaller than the fine structure— 
hence the name hyperfine structure. This factor of ( ni e /m p ) enters because the 
magnetic moment of the proton is smaller than that of the electron by ( m e /m p ). As 
Fig. 11.10b shows, this hyperfine structure occurs in excited states of the atom as 
well, producing splittings equal to 24 MHz for the 2 p 3/2 level. 178 MHz for the 2.V| /2 
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All 


1 * 1/2 


1420.4 MHz 


3AI2 


2 * 1/2 


177.6 MHz 


1057.8 MHz 


: 592 MHz 


(a) 


(b) 


Figure 11.10 The hypertine splitting of (a) the n = 1 and (b) the n = 2 energy- 
levels of hydrogen. The Lamb shift is the 1057.8 MHz splitting between the 
2.9^2 and 2/7 1/2 states of hydrogen, which is due to quantum elcctrodynamic 
effects. Without these QED effects, states of different / and the same j would 
be degenerate, as shown in Fig. 11.9. The hyperfine splitting of the 2py 2 level 
is not shown because the tine-structure splitting between the j = § and j — \ 
levels is roughly ten times larger than the Lamb shift. 


level, and 59 MHz for the 2/> 1/2 level. However, the form of the Hamiltonian is not 
as simple as (5.9) for states with orbital angular momentum l ^ 0. 14 

11.7 The Zeeman Effect in Hydrogen 


In Section 11.3 we examined the Stark effect, which is produced when a hydrogen 
atom is placed in an external electric field. In 1896 Zeeman observed the splitting 
of the spectral lines of the light emitted by an atom placed in an external magnetic 
held. To analyze this Zeeman effect in hydrogen, we take 

= = -(-^-L+—S J-B (11.104) 

\2m e c m e c J 

as the form for the interaction Hamiltonian, where the first term in parentheses is 
the magnetic moment operator due to orbital motion [see (1.2)], while the second 
term is that due to intrinsic spin for the electron. We have neglected the contribution 
of the proton to (11.104) because of the small magnitude of the proton's magnetic 
moment. Equation (11.104) includes the dominant part of the magnetic interaction 
for a one-electron atom if the applied held is not extremely strong. 1 - If we orient our 


14 For a derivation of the full hyperfine Hamiltonian, see S. Gasiorowicz, Quantum Physics, 
3 rd edition. Wiley, New York, 2003. For a calculation of these hyperfine splittings, see H. A. Bethe 
and F„ R. Salpeter, Quantum Mechanics of One- and Two-Electron Atoms, Springer-Verlag, Berlin, 
1957. Section 22. 

15 The full Hamiltonian in a magnetic field, ignoring intrinsic spin, is derived in Appendix E. 
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coordinate axes so that the magnetic field points in the z direction, the perturbing 
Hamiltonian becomes 

H H = -^-(L z + 2S r ) (11.105) 

2 m e c 

Let’s consider the effect of an external magnetic field on the n = 2 states of 
hydrogen. For magnetic fields on the order of 10 4 gauss, the magnitude etlB/m,,c 
of H b is comparable with that of spin-orbit energy (11.96) for n = 2. Thus for 
magnetic fields with a strength of a few thousand gauss or less, we can treat H B as a 
perturbation to the Hamiltonian of the hydrogen atom including spin-orbit coupling, 
in which case the states with j — l + \ and j = l — \ are not degenerate. Since H B 
may be written as 

-■t 

a . eB * ~ 

H b = - -(V 7 + S,) (11.106) 

2m e c 


which clearly commutes with J z as well as L 2 . states with different m, values are 
not mixed together by the perturbation. Therefore we can calculate the first-order 
shift in the energy as the expectation value of (11.106) in the states (11.93): 


r-O) _ 

E h — 


eB 
2 m.c 


{j = l±\, mj\(J z + S,)\j =1 ± mj) (11.107) 


Of course, the expectation value of J z in these states is just its eigenvalue but 
to evaluate (S z ), we must use the explicit form (11.93) for the states: 



// ±nij + | 

V 2 /-I- 1 


1 T mj + 4 \ 
2/ + 1 / 


m :h 


21 + 1 


(11.108) 


Hence 


r(l)_ 

t B — 


ehB 


-m , 


1± 


1 


2 m e c J \ 2/ + L 
Notice that we can express this energy shift compactly in the form 

,- 0 ) g(jJ)eHB 

Hi n — III l 

B 2 m e c J 


(11.109) 


( 11 . 110 ) 


which is reminiscent of the energy of a particle of spin j in an external magnetic 
field with a g factor 


g{j=l±\,l) = 



(11.111) 


known as the Lande g factor. 

Figure 11.11 shows the splitting ofthe 1 ji/ 2 < 2p l/2 , and states in an external 
magnetic field. Notice that the Lande g factor is 2 for the states, | lor the 
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(b) 

Figure 11.11 (a) The Zeeman effect for the L? and 
2p levels of hydrogen in a weak external magnetic 
field, showing the allowed electric dipole transitions, 
(b) A schematic diagram of the resulting spectrum. The 
dashed lines show the fine structure that is present in 
the absence of an external magnetic field. 


p ]/2 states, and § for the p 3/ o states. In Chapter 14 we will see how selection 
rules for electromagnetic transitions arise. The allowed electric-dipole transitions 
(A nij = 0. ±1) are indicated in the figure along with the corresponding spectrum. 
It is interesting to compare these results with what the spectrum would look like if 
the electron did not have intrinsic spin (see Problem 11.12). 


11.8 Summary 


To analyze a system using time-independent perturbation theory, we express the full 
Hamiltonian for the system in the form 


£ = H 0 + ^i (11.112) 

where the eigenstates and eigenvalues of the unperturbed Hamiltonian H 0 are 
given by 


*«\<P?) = E?W™) (1U13) 
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The perturbing Hamiltonian //, may arise from external perturbations such as those 
that come from applying electric fields—the Stark effect (Example 11.1 and Sections 
11.3 and 11.4)—or those that come from applying magnetic fields—the Zeeman 
effect (Section 11.7)—to the system, or from internal perturbations such as those 
causing the fine structure of the hydrogen atom (Section 11.5). If the slate |<p* 0) ) is 
not degenerate, the first-order and second-order corrections to the energy arc given by 


El" = 


(0)v 


and 




k£n 


p(0) _ r(0> 


•> 




(11.114) 


(11.115) 


When the unperturbed energy eigenstates are degenerate, formula (11.114) [as 
well as (11.115)] does not apply. Rather, the first-order corrections to the energy are 
the eigenvalues of the eigenvalue equation for the operator //, using the degenerate 
eigenstates of H 0 as a basis (sec Section 11.2). Often we can take advantage of a 
symmetry of the perturbing Hamiltonian to reduce the size of the degenerate 
subspaee in which we need to work. In particular, if [Hi, 4] = 0 (where A may be 
the generator of a symmetry operation lor H : ), only states that have both the same 
energy and the same eigenvalue a of the operator A are mixed together by the 
perturbation. See Sections 11.3 and 11.5 for illustrations. 


Problems 


11.1. Consider a perturbation H\ = bx 4 to the simple harmonic oscillator Hamilto¬ 
nian 


p! 1 

H 0 = — + -imo 4 x 
0 2m 2 


2-2 


This is an example of an anharmonie oscillator, one with a nonlinear restoring force, 
(a) Show that the first-order shift in the energy is given by 


£<» = 


3 h 2 b 




(l + 2/i + 2/r) 


(b) Argue that no matter how small b is, the perturbation expansion will break 
down for some sufficiently large n. What is the physical reason? 


11.2. For Example 11.1 use the series expansion for the exponential in the translation 
operator in 

!*„} = i(q\VAImar)\n)=e- i ^ ]f, ' lmui2u \n) 
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to evaluate the first-order correction to the state of the harmonic oscillator due to an 
applied electric field. Compare your results with the perturbative result. 


11.3. For the simple harmonic oscillator, for which 


H - ^ 
H °~2m 


-maP'x 2 


take the perturbing Hamiltonian to be 


H\ = -mccr.x 2 
2 1 

where co ] <<C co. Calculate the energy shifts through second order and compare with 
the exact eigenvalues. 


11.4. Calculate the first-order shift to the energy of the ground stale and first excited 
state of a particle of mass m in the one-dimensional infinite square well 


V(x) = 


0 0 < x < L 

oc elsewhere 


of (a) the constant perturbation H { — V x and (b) the linearly increasing perturbation 
H x — eE\ 0) x/L, where E[ 0) is the unperturbed energy of the ground state and e <£ 1. 


11.5. 

(a) Calculate the exact energy eigenstates of the Hamiltonian (11.7) of the am¬ 
monia molecule in an external electric field. 

(b) Assuming that /r e |E| <5C A, use perturbation theory to determine the first-order 
correction to the unperturbed eigenstates |7) and | // > and compare with the 
results of (a). 


11.6. The spin Hamiltonian for a spin-4 particle in an external magnetic field is 

// = -£ B = -^-S B 
2m c 

Take B — £ 0 k -f fi 2 j- with B 2 « S 0 . Determine the energy eigenvalues exactly and 
compare with the results of perturbation theory through second order in B 2 /B 0 . 


11.7. Assume that the proton is a uniformly charged sphere of radius R. 

(a) Show for the hydrogen atom that the potential energy of the electron in the 
field of the proton is given by 


V(r) = 


r < R 

2R 3 V 3 / 


e 

r 


r > R 


Hint: Use Gauss’s law and remember that the potential energy V ( r ) must be 
continuous. 
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(b) Calculate the energy shift for the Ls and 2 p states of hydrogen if the potential 
energy in (a) is used. What effect does this shift have upon the Lyman a 
wavelength? Suggestion: You can use the fact that R <£ a 0 to simplify the 
integrand before evaluating the integrals. 

11.8. Use the form of the Y t m (6, <p )'s to verify that (n, /'. m'\z\n, /, in) — 0 for 
m ^ in'. 


11.9. A particle of mass m is confined in the three-dimensional potential energy box 

0 0<x<L, 0 <y<L, 0 <z<L 

oo elsewhere 


V(x, y, z) — 


Determine the first-order shift in the energies of the ground state and the first excited 
states due to the perturbation 


H, = 


V\ 0<jc<L/2, 0<y<L/2, 0 <z<L 
0 elsewhere 

which raises the potential energy by an amount in one quarter of the box. 


11.10. The spin Hamiltonian of a spin-1 ion in a crystal is given by 

a ~ 7 h 

h? S; + ¥ ( 




Assume b <£ a and treat 




as a perturbation. Calculate the unperturbed energies and the first-order corrections 
using perturbation theory. Beware of the degeneracy. Compare your perturbative 
results with the exact eigenvalues. 


11.11. For the two-dimensional harmonic oscillator, the unperturbed Hamiltonian is 
given by 

- P 2 x 1 2-2 P 2 y , 1 2-2 

Hq = — + -nut) X -- + -7MGTV 

2m 2 2m 2 ' 

Determine the first-order energy shifts to the ground state and the degenerate first 
excited states due to the perturbation 

= bxy 


11 . 12 . 

(a) Determine how the energy levels of the hydrogen atom for the l.v and 2 p stales 
would appear in the absence of any intrinsic spin for the electron, with the only 
contribution to the fine structure coming from the relativistic correction to the 
kinetic energy. 
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(b) What happens to these energy levels if the atom is placed in an external 
magnetic field? 

(cl What is the resulting spectrum? 


11.13. Show for a general potential energy V (r) that the form of the spin-orbit 
Hamiltonian (11.78) becomes 


"so — 


1 


2m 2 c 2 \r\ dr 


"is 


Suggestion: Start with (11.77). 


11.14. Obtain the states (11.92a) and (11.92b). 

11.15. Determine the effect of an external magnetic field on the energy levels of the 
n = 2 states of hydrogen when the applied magnetic field D has a magnitude much 
greater than 10 4 gauss, in which case the spin-orbit interaction may be neglected as 
a first approximation. This is the Paschen-Bach effect. 

The following four problems provide us with some techniques for evaluating some 
of the hydrogen-atom expectation values that we have used in this chapter. These 
“tricks” are given by R. Shankar, Principles of Quantum Mechanics. 


11.16. In order to evaluate (1/r) consider y/r as a perturbation for the hydrogenic 
atom, where we can think of y as some “small” constant. The first-order shift in the 
energy is given by 


£ (,) - 
n 



which is clearly linear in y. 

(a) First show that the exact eigenvalues are given by 

_ p{Ze 2 -y) 2 

" 2h 2 n 2 

Suggestion: Examine (10.32). 

(b) Since E n — £ 40) -f FfJ 1 + E ,2> -1- • • •. w'e can obtain either by explicitly 
finding the contribution to E n that is linear in y, or. more generally, noting 
that 


E 


<» = y 

n 1 



r =o 


since E^ y> is of course independent of y and the higher order terms in the 
expansion are at least of order y 2 . In this way show that 
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/1 \ _ pZe 2 _ Z/xca 

h 2 n 2 hn 2 


Z 

a 0 n 2 


11.17. 

(a) Treat yp 2 /2fi as a perturbation for the hydrogen-like atom and. using the 
techniques of Problem 11.16, show that 



lxc 2 Z 2 a 2 

Tji 2 


(b) Use the results of (a) and Problem 11.16 to show that for the hydrogen atom 

(K) = -'-{V) 

in agreement with the virial theorem in quantum mechanics. .See Prob¬ 
lem 10.13. 


11.18. In order to evaluate ( \/r 2 ) take y/r 2 as a perturbation for the hydrogen atom. 
Here again we can obtain the exact solution since the perturbation modifies the 
centrifugal potential in (10.12) as follows: 

/(/+ I )h 2 | y _ Hl'+DH 2 

2pr 2 r 2 ~ 2pr 2 

Thus the exact energy is given by 

_ pc 2 Z 2 a 2 
~ ~ 2(n r + /'+ l) 2 

Show that 



11.19. We cannot use the techniques of Problem 11.16 to evaluate ( \/r •’). since there 
is no term in the Coulomb Hamiltonian that involves 1/r 3 . However, use the fact that 


(n, I. p r ] |n, L m) =0 

where p r is the radial momentum operator introduced in (9.92) and H 0 is the 
unperturbed hydrogenic Hamiltonian (11.65), to show that 

(!) = z ll) = __ 

\r 3 ln,i,m + 1 ) \r 2 ln,l,m a^n 3 l(l + 1 )(/ + 1 ) 
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CHAPTER 12 


Identical Particles 


* 


In any discussion of mulliclectron atoms, molecules, solids, nuclei, or elementary 
particles, we face systems that involve identical particles. As we will discuss in 
this chapter, the truly indistinguishable nature of identical particles within quantum 
mechanics has profound consequences for the way the physical world behaves. 


12.1 Indistinguishable Particles in Quantum Mechanics 


As far as we can tell, all electrons are identical. They all have the same mass, the 
same charge, and the same intrinsic spin. There are no additional properties, such 
as color, that allow us to distinguish one electron from another. Yet within classical 
mechanics, identical particles are, in principle, distinguishable. You don't have to 
paint one of them red and one of them green to be able to tell two identical particles 
apart. If at some initial time you specify the positions and the velocities (r j . v j ) and 
(r 2 , v 2 ) of two interacting panicles, you can calculate their positions and velocities 
at all later times. The panicles follow well-defined trajectories, so you don't need 
to actually observe the particles to be sure which is which when you find one of the 
particles at a later time. In any case, within classical theory 7 you w ould, in principle, 
be permitted to make measurements of the particles’ positions and velocities without 
influencing their motions so that you could actually follow the trajectories of the two 
particles and thus keep track of them. 

Life in the real world is different, at least on the microscopic level. As we have 
seen in Chapter 8, in many microscopic situations there is no well-defined trajectory 
that a particle follows. The particle has amplitudes to take all paths. Or in the 
language of wave functions, each of the particles may have an amplitude to be at 
a variety of overlapping positions, as indicated in Fig. 12.1, so we cannot be sure 
which of the particles we have found if we make a subsequent measurement of the 
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Figure 12.1 A schematic diagram indicating the po¬ 
sition probability distribution for two particles. Since 
these distributions overlap, there is no way to be sure 
which particle we have detected if we make a measure¬ 
ment of the particle’s position and the two particles are 
identical. 

•» 

particle’s position. Moreover, any attempt to keep track of the particle by measuring 
its position is bound to change fundamentally the particle’s quantum state. 

With these considerations in mind, let’s see what types of states are allowed for 
a pair of identical particles. We specify a two-particle state by 

\a,b) = \a) l ®\b) 2 (12.1) 

where a single-particle state such as |a)j specifies the state of particle 1 and \b) 2 
specifies the state of particle 2. 

We introduce the exchange operator P 12 , which is defined by 

P I2 |a, b) = \b, a) (12.2a) 


or 


A 2 (|a)l®l*)2) = l*)i®|fl)2 (12.2b) 

As an example, the effect of the exchange operator on the state |r,, +z), ig> |r 2 , — z} 2 , 
which has particle 1 at position r, with S z = h/2 and particle 2 at position r 2 with 
S z = —h/2, is to produce the state |r 2 , —z), <gi |r,, +z) 2 which has particle 1 at 
position r 2 withS. — —h/2 and particle 2 at position r, with S z = H /2 (see Fig. 12.2). 
The exchange operator interchanges the particles, switching the subscript labels 1 
and 2 on the states. Since for any physical state of two identical particles we cannot 
tell if we have exchanged the particles, the “exchanged” state must be the same 
physical state and therefore can differ from the initial state by at most an overall 
phase: 

PnW) = j t '\*) = W) (12.3) 

Thus the allowed physical states are eigenstates of the exchange operator with 
eigenvalue X. Applying the exchange operator twice yields the identity operator. 
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Figure 12.2 The effect of the exchange operator on a state 
of two spin-4 particles as shown in (a) is to exchange both 
the positions and the spins (indicated by the double arrow ). 
as shown in (b). '' 


Therefore 


P; 2 W) = \ 2 W) = W) (12.4) 

which shows X 2 = 1, or A. = ±1 are the two allowed eigenvalues. 1 

Clearly, if the two identical particles are each in the same state |a), they are in an 
eigenstate of the exchange operator with eigenvalue X = 1: 

P ]2 \a, a) = \a. a) (12.5) 


indicating that the stale is symmetric under exchange. If/? ^ a. we can find the linear 
combinations of the two states |u. b) and |Z?, a) that are eigenstates of the exchange 
operator. The matrix representation of the exchange operator using these states as a 
basis is given by 

p b\P l2 \a,b) (a, b\P l2 \b, a) / 0 I\ 

\(b,a\P r _\a,b) (b, a\P l2 \b, a) )~ V 1 0/ 

where we have used action of the exchange operator as given in (12.2) and assumed 
that the two states |«. b) and |/?. a) are normalizable and orthogonal. Thus the 
condition that the eigenvalue equation (12.3) has a nontrivial solution is given by 


-X 

1 



(12.7) 


1 There are exceptions to tills rule in two-dimensional systems. See the article “Anyons” by 
F. Wilcxek, Scientific American, May 1991. p. 58. 
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which also yields A = ±1 as before. Substituting the eigenvalues into the eigenvalue 
equation, we find that the eigenstates corresponding to these eigenvalues are given by 


\rlf s ) = ~\o,b) + ~\b,a) A = 1 
\rlr A ) = ~\a,b)~^=\b,a) A = -l 


(12.8a) 

(12.8b) 


where the subscripts S and A indicate that these two eigenstates are symmetric and 
antisymmetric, respectively, under the interchange of the two particles. Notice that 
two identical particles must be in either the state \\j/ s ) or the state but they 
cannot be in a superposition of these states, for then exchanging the two particles 
does not lead to a state that differs from the initial state by an overall phase: 


^12 (csl'I's) + CaWa)) = csWs) ~ c a\^a) (12.9) 


Thus the particles must make a choice between \t/r s ) and | \j/ A ). In fact, it turns out 
that Nature makes the choice for them in a strikingly comprehensive way: 

Particles with an integral intrinsic spin, .v = 0, 1, 2, ..., are found to be only in 
symmetric states and are called bosons; these particles obey Bose-Einstein statis¬ 
tics. 2 Examples of such particles include fundamental elementary particles such as 
photons, gluons, the VT ± and Z 0 intermediate vector bosons, and the graviton— 
particles that mediate the electromagnetic, strong, weak, and gravitational interac¬ 
tions, respectively—as well as composite particles such as pions and nuclei such 
as 4 He. 

Particles with half-integral intrinsic spin, s = §.are found to be only 

in antisymmetric states and are called fermions; these particles obey Fcrmi-Dirac 
statistics. Examples of such particles include fundamental elementary particles such 
as electrons, muons, neutrinos, and quarks, as well as composite particles such as 
protons, neutrons, and nuclei such as 3 He. 


2 The symmetry requirement on the allowed quantum states of identical bosons leads to a 
statistical distribution function for an ensemble of N identical bosons in thermal equilibrium at a 
temperature T that is different from the classical Boltzmann distribution function. In particular, 
the number of bosons in a particular state with energy E is given by 

" {E> = eOeE/W - 1 

where the value of a is chosen so as to ensure that the total number of particles is indeed /V. On 
the other hand, the antisymmetry requirement on the allowed quantum states for an ensemble on 
N identical fermions leads to the distribution function 

niE) = e* e E/k b T + J 

Note: n(E ) can be very large for the Bose-Einstein distribution, while for the Fermi-Dirac distri¬ 
bution n (E) < I. For a derivation of these quantum distribution functions, see. for example, F. Rcif. 
Fundamentals of Statistical and Thermal Physics, McGraw-Hill, New York, 1965, Chapter 9. 
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At the level of nonrelativistic quantum mechanics, this relationship between the 
intrinsic spin of the particle and the exchange symmetry of the quantum state is a law 
of nature—often referred to as the spin-statistics theorem—that we must accept as 
a given. We can take comfort in the fact that this spin-statistics theorem can be shown 
to be a necessary 1 consequence of relativistic quantum field theory.' 1 In Chapter 14 
we consider the fully relativistic quantum field theory for photons, and we can then 
see why, as an example, photons must indeed be bosons. 

Finally, as we saw in (12.5), if tw'o identical particles are in the same state, the 
state is necessarily symmetric under interchange, and therefore such a state cannot 
be occupied by fermions. Thus two electrons, two spin-| particles, cannot occupy 
the same state—a statement of the Pauli exclusion principle. We will see how 
this principle plays a fundamental role in determining the structure of atoms and 
molecules. 


EXAMPLE 12.1 If |p} and |q) are photon states with momentum p and q, 
respectively, and | R) and | L) arc right-circularly and left-circularly polarized 
photon states, respectively, then which of the following states are possible 
states of two photons? 

(a) (^lp>,lq>2 + TflqhlPh) (j||ff)iH->2 - Tfl'-hltfh) 

( b ) (^lp)ilq>2 - Tflqhlph) ltf)iltf>2 

(c) (^lp)ilq)2 + ^lq>ilp> 2 ) (^|/?>,|/-) 2 + ^iq 1 |/?>2) 

(d) lp)ilq) 2 l*>iK>2 

(e) (^|p>,|q } 2 - ^|q)ilp) 2 ) (^|/?>il /-) 2 - 

SOLUTION Since photons are spin-1 particles, the two-photon state must 
be symmetric under exchange. Only states (c) and (e) have the property 
that P 12 \tfr) — |i j>). In (c) the two-photon momentum and the two-photon 
polarization states are both symmetric under exchange, while in (e) they are 
both antisymmetric under exchange. In either case these states are symmetric 
under exchange when all the attributes of the photons are swapped, as is 
required for a state consisting of identical bosons. In contrast, the states (a) 
and (b) are overall antisymmetric under exchange and the state (d) does not 
have any definite exchange symmetry. Thus the states (a), (b), and (d) are 
not possible tw'o-photon states. 


A comprehensive but advanced discussion is given by R. Streater and A. S. Wightman, PCT, 
Spin and Statistics, and All That, W. A. Benjamin, New York, 1964. 
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12.2 The Helium Atom 


As an interesting example of a system containing two identical fermions, we start 
with the Hamiltonian 


*1 *■ 



2m e 2 m e 


Ze 2 Ze 2 | e 2 

l*i I l* 2 l |rj-r 2 | 


( 12 . 10 ) 


which includes the principal electrostatic interactions between the nucleus and the 
electrons in the helium atom when we take Z = 2 for the charge on the nucleus and 
ignore the contribution of the kinetic energy of the nucleus to the energy of the atom 
(see Fig. 12.3). One approach is to treat 


h = iL _ + il'_ 

0 2m e |rj 2 m e |r 2 | 


( 12 . 11 ) 


as the unperturbed Hamiltonian and the Coulomb energy of repulsion of the two 
electrons 


#1 = 


*2l 


( 12 . 12 ) 


as a perturbation. Although we do not expect the perturbation to be much smaller 
than the interaction of the electrons with the nucleus, nonetheless breaking the 
Hamiltonian into (12.11) and (12.12) is an attractive option. Since r, and pj commute 
with r 2 and p 2 , the unperturbed Hamiltonian is just the sum of two independent 
Coulomb Hamiltonians. Thus we can express the eigenstates of the unperturbed 
Hamiltonian as simultaneous hydrogenic eigenstates \n u <8> |n 2 , /?, w 2 ) 2 , 

which we know well. On the other hand, the full Hamiltonian (12.10) is just too 
complicated to solve directly, and so we must resort to approximation methods. 
Moreover, this perturbative approach has much to teach us, both qualitatively and 
quantitatively, about the important effects of the identical nature of the electrons on 
the spectrum of the helium atom. 


THE GROUND STATE 

Let’s start with the ground state of H^. in which each of the particles is in the lowest 
energy state 


| 1 , 0 , 0 >! < S > | 1 , 0 , 0) 2 = | 1 . 0 . 0 >,| 1 , 0 , 0) 2 


( 12 . 13 ) 


e 



Figure 12.3 The positions r t and r 2 of the two electrons with 
respect to the nucleus in the helium atom. 
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where on the right-hand side we have dispensed with the direct-product symbol, just 
as we did in our earlier discussion of two-particle spin states in Chapter 5. Although 
this state is clearly symmetric under interchange of the two particles, we have not 
yet specified the spin states of the two electrons. There is only one way to do this 
and make the total state of the two particles antisymmetric; namely, the spin state 
of the two particles must be 


y=(|+z)]|—z) 2 — I — z ) 11+2)2) 


(12.14) 


which is antisymmetric under exchange of the spins of the two particles. Thus the 
ground state of the two-electron system is given by 


I Is, Is) = |1, 0, 0),|1, 0, 0) 2 ^=(|+z),|—z) 2 - |-z), + |z) 2 ) 

V2 


(12.15) 


where the label l.v. Is on the overall ket indicates that each of the electrons is in the 
n = 1. / = 0 state of the hydrogenic Hamiltonian. Recall from (5.31) that the spin 
state (12.14) is just the state 


|0, 0) = -J=(|+z) 1 |-z) 2 - |-z)i|+z> 2 ) 


(12.16) 


where 


(S x -+- S 2 ) 2 |0, 0) =0 (12.17a) 

(Sj z + 5 2 z )| 0. 0) =0 (12.17b) 

The ground state of the two-electron system must be a total-spin-0 state, even though 
the Hamiltonian (12.10) itself docs not depend on spin. The spectroscopic notation 4 
for this state is 'S 0 . From (10.34), the energy of the unperturbed ground state is 

E®, = 2 (-±m e c 2 Z 2 a 2 ) = -8(13.6 eV) = -108.8 eV (12.18) 

where we have put Z = 2 to determine a numerical value. 

The first-order shift in the ground-state energy is given by 

= (I*, ls\ e \ d ig, Is) (12.19) 

l r l _ r 2l 


4 It is conventional to label the total spin .S', the total orbital angular momentum L, and the total 
angular momentum J of atomic states in the form 2s ~ i L J , where the orbital angular momentum 
label is given in the usual spectroscopic notation: L = 0 is 5, L = 1 is P, L = 2 is D, and so on. 
Note the 25 -b 1 superscript gives the spin multiplicity for each state: 5 = 0 corresponds to a single 
total-spin state, while for 5 = 1 there is the usual triplet of spin-1 states. For atoms with more than 
two electrons, the values of total spin will, of course, differ from these values. 
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Since the perturbing Hamiltonian depends on the position operators r, and f 2 , it 
is natural to evaluate (12.19) in position space. For the ground state, the spin- 
independent part of the state is symmetric under exchange of the two particles, 
and we need to ask how we express a symmetric spatial state \\j/ s ) of two identical 
particles in position space. We first write 

Ms) = \ J d\ d \ 2 r 2 > + ^|l r 2 ’ r t>) 

x ^ J=(r, t r 2 l ts) + ^( r 2 - r il w) (12.20) 

where we have been careful to use the appropriate two-particle position states 

y=l r l- r 2) + yf I*" 2 ’ r l) = ^l r lhl r 2)2 + y^l r 2)ll r l)2 (12.21) 

for identical particles that are symmetric under interchange of the two identical 
particles. The factor of ± in front of the right-hand side of (12.20) is necessary 
because when we integrate over all values of rj and r 2 , we count each two-particle 
position state twice. However, for a symmetric state 

<rj.r 2 |* s ) = {r 2 .r,|* s ) (12.22) 


and thus 


l*5> = \ j d \ d * r 2 (Iri, r 2 )(r,. r 2 \rj/ s ) + |r 2 , r,}^. r 2 |^ s » 

= J </ 3 r, rf-Vilrj, r 2 )(r,, r 2 \^ s ) (12.23) 

where in the last step we have interchanged the dummy variables r, <-> r 2 in the 
second term and then taken advantage of (12.22). Notice that the result (12.23) is 
exactly what we would have obtained had we just inserted two-particle position states 
|r t , r 2 ) for nonidentical particles. You can verify that a similar result holds for an 
antisymmetric spatial state | \j/ A ). See Problem 12.1. 

Using (12.23), we now see that (12.19) becomes 

EL\ = ff d \ A>l( r tll- 0. 0)i 2 |(r 2 |l. 0, 0)| 2 -———- (12.24) 

JJ |Tj —r 2 | 

Since | (rj| 1. 0, 0)1 2 is the probability density of finding an electron at r,, 

tf|(r 1 |l,0,0>| 2 = /»(r 1 ) (12.25) 
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has the form of the charge density due to this electron. The electric potential at r? 
produced by this charge density is 

[ A, —- r|) (12.26) 

J V,-r 2 | 

Therefore the energy of interaction of this charge density with the charge density 


e|(r 2 |l,0.0)| 2 = p(r 2 ) (12.27) 

of the other electron is the usual result from electrostatics 

E uu= f( * VS P , (r ' )P(l : - ) (12.28) 

’ JJ l r i — r 2l 

which gives us a nice, physical way to interpret (12.24). 

Using the wave function (r| 1. 0, 0) = 0 F 0 0 = (M \f^)(2/%) ill e~ Zr/ao , with 

K lt0 from (10.43) and y 0 0 = l/\/4 tt, the energy shift (12.24) becomes 



Evaluating the integrals in (12.29) is relatively straightforward, since there is no 
angular dependence in the ground-state wave function. In particular, we have the 
freedom to choose the z axis of the dummy variable r | to point in the direction of r 2 
so that r, • r 2 = r { r 2 cos 0 2 , giving us an exact differential for the (A integral. Then 


[ dSl-, ---= f d<pi f sin 0-,dd 2 ^ -s- 

J 2 |r, — r 2 | Jo Jo ‘ (r 2 + r 2 -2 

=^[( r ' +r =- 2r,riCOse2 )' 2 ]l 


r,r 2 cos^ 2 )’/ 2 

&2=7T 


2tt , . .. 

--(r, +r 2 - ki - r 2 |) 

r l r 2 


(12.30) 


Since there is no angular dependence left in the integral (12.29), we can use 
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J dQ, = 4n to do the remaining angular integrals. Finally, doing the radial integral, 
we obtain 


£ <i) _ 


( y \ $ r 00 pOO 

— ) Se 2 dr, r,e~ 2Zr,/a(> dr 2 r 2 e~ 2Zn,a °(r, + r 2 - |r, - r 2 |) 
%/ J Jo Jo 

( — ) 8 e 2 \^ dr, r,e- 1Zr ^ 

\ao/ J Jo 

x (lj ' dr 2 r 2 j- 2Zri - lao + 2r, ^ dr 2 r 2 e- r/r '- /a <^j 


= -Zm e c 2 a 2 = 34.0 eV 
8 


(12.31) 


where we have replaced the Bohr radius with a 0 = H/m e ca, ignoring reduced mass 
effects, and put Z = 2 to obtain a numerical result. 

Adding the first-order shift in the energy to the unperturbed value, we find that 
the ground-state energy of the helium atom is given by 


= E ( y U + = -108.8 eV + 34.0 eV = -74.8 eV (12.32) 


This is to be compared with experimental value = —79.0 eV. Thus there is 
a sizable discrepancy between our perturbative calculation and experiment. This is 
actually not surprising, since we have no reason to expect that the Coulomb repulsion 
between the two electrons in the helium atom, which we treated as a perturbation, 
should be much less than the energy of attraction of the electrons to the nucleus. 
In fact, considering the size of the first-order shift (12.31), we haven’t done badly 
at all in this primitive attempt at a perturbative solution. After a discussion of the 
excited states, we will examine an alternative method of determining the energies 
with higher accuracy. 


THE EXCITED STATES 

Let’s turn our attention to the first excited states, in which one of the particles is 
in the state 11, 0, 0) while the other is in one of the four states |2, /. m), which are 
all degenerate eigenstates of the single-particle hydrogcnic Hamiltonian. Taking the 
spin states of the two electrons into account, we can construct a number of two- 
electron states of the form (12.8b) that are antisymmetric under exchange of the two 
particles: 


— (|1, 0. 0, +z),|2, /, m, +z) 2 - |2, /, m, +z),| I, 0, 0, +z) 2 ) 
v2 


= -r=(|l, 0, 0),|2, /, m) 2 - |2, /, m),11, 0, 0) 2 )|+z),|+z> 2 (12.33a) 


Page 445 (metric system) 



12.2 The Helium Atom | 429 


1 


V2 


(|1. o, 0, -z),|2, /. m, -z) 2 - |2, l, m, —z),| 1, 0. 0. -z) 2 ) 


= 0, 0),|2, /, m) 2 - |2, m)!|l, 0, 0> 2 )|-z>,| 


~ z )2 


(12.33b) 


-pdl. 0. 0, +z)||2, /, m. -z), - |2, /, m, -z)j|l, 0. 0, +z)->) 
V2 


— 7 =(| 1 . 0. 0, -z>!|2, /, m, +z> 2 - |2, /, m, +z),|l. 0, 0, -z> 2 ) 

v2 


(12.33c) 

(12.33d) 


The unperturbed energy of each of these states is given by 


E { 0) 

c li, 2jor2p 


2 7 2 2 
— m.cZct 

2 



= -68.0 eV 


(12.34) 


where the subscript label shows that one of the electrons has n = 1, / = 0, while the 
other has n = 2 and either / = 0 or / = 1. As before, for helium we have set Z = 2 
to obtain a numerical value. 

Unlike (12.33a) and (12.33b), which have total-spin states 11, 1) and |1. —1) re¬ 
spectively, the states (12.33c) and (12.33d) are not eigenstates of total spin. However, 
we have not yet taken into account the effect of the perturbing Hamiltonian (12.12). 
Since the states (12.33) are all degenerate, we must find the proper linear combina¬ 
tions that diagonalize H x . First, note that since the two electrons are identical, 
as well as H 0 must commute with the exchange operator. Although we are required 
to form total eigenstates of the two electrons that are antisymmetric with respect 
to exchange, we have not taken advantage of the full exchange symmetry of the 
Hamiltonian. In particular, we can express the exchange operator 


P l2 =P^ ce P^ a (12.35) 

where the operator Pp‘ n exchanges the spin slates of the two electrons and Pp ace 
exchanges the spatial states. Since 

[H b P,T CC J = [H u Pjf"] = 0 (12.36) 


we can diagonalize the interacting Hamiltonian H { by choosing states that are 
eigenstates of both Pp in and P p 3CC . provided we are careful to choose states that 
are overall antisymmetric under complete exchange. 

The states (12.33a) and (12.33b) already satisfy this requirement. They are 
both symmetric under exchange of the spins of the electrons and antisymmetric 
under exchange of the spatial states. From the states (12.33c) and (12.33d) we can 
choose two combinations that are completely antisymmetric under exchange. One 
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combination has a symmetric spin state and an antisymmetric spatial state 


-4(11. 0. 0),|2, /. m) 2 - |2, /, m),|l, 0. 0) 2 )-^(|+z> 1 |-z> 2 + |-z) 1 |+z) 2 ) 
>/2 v2 


(12.37) 


here the spin state is just the total-spin-1 state 11, 0). and the other has an antisym¬ 
metric spin state and a symmetric spatial state 


— 7=(.\ 1. 0, 0)^2, /, m) 2 + |2, /, m)i|l, 0, 0) 2 )— t=(|4-z)i|— z) 2 — |— z)il+z) 2 ) 
V2 v2 


(12.38) 


where the spin stale is the total-spin-0 state |0, 0). We can therefore condense our 
notation and express the excited states in the form 


—(|l. 0. 0)i|2, /, in) 2 - |2, l, m),|l, 0, 0) 2 )|1, m,> 
V2 

with m, taking on the values 1,0, and —1. and 


— (|1, 0, 0),|2. /. ro> 2 + 12, /, w)!11, 0. 0) 2 )|0. 0) 


(12.39) 


(12.40) 


We also might have been led to select these particular combinations of states by 
noting that 


|tf,S, +S 2 ] = 0 (12.41) 

and thus we can find eigenstates of the Hamiltonian that are eigenstates of total spin. 

What is the effect of the perturbing Hamiltonian on the energy of these states? At 
first glance, the problem of evaluating the first-order shifts still seems to be a large 
one, since there are four different degenerate spatial states (/ = 0. m = 0 and / = 1, 
m — ± 1 and 0) for each total-spin state. However, there is an additional symmetry 
of H [ that wc have not yet utilized. Notice that if we rotate the positions r, and r 2 
of both particles in (12.12). H l doesn’t change. The generator of position rotations 
for the two-electron system is just the total orbital angular momentum operator 

L = r, x p,+r 2 x p 2 (12.42) 

Thus the perturbing Hamiltonian commutes with the total orbital angular momen¬ 
tum. Since one of the particles in the states (12.39) and (12.40) is in a state with zero 
orbital angular momentum, these stales are already eigenstates with a total orbital 
angular momentum I and z-component m. Therefore we can calculate the first-order 
shift simply as 
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£ (1) = 0, 0| 2 (2, /, ml ± ,<2, /, m| 2 (I, 0, 0|) — 


2 .*.. l^i — r 2 l 

x (|1, 0, 0)j|2, /, m >2 ± |2, l, m)|| 1, 0, 0) 2 ) (12.43) 

Evaluating this expectation value in position space, we find 


>l d \ 2 ((1, 0, 0| ri ><2, /, m|r 2 ) ± (1. 0, 0|r 2 )<2. /. m|r,»- 


-UI* 

x«r,| 1, 0, 0)(r 2 |2, /, m) ± (r,|2 t l, m)(r 2 |l, 0. 0)) 

= JJ d\ d\ |(1, 0. 0|r,>| 2 |<2, /. m|r 2 )| 2 


|r, - r 2 | 


l r l ~ r 2l 


// 


cfVj d i r 2 (1, 0, 0|r 1 )(2, /, m|r 2 ) 


r i - r 2 | 


(r,|2, /, m)(r 2 |l, 0, 0) 


(12.44) 


where in the last step we have made use of the symmetry of the perturbation under 
r, r 2 . Thus we can express the first-order shift in the energy of the first excited 
states in the form 


E m = J±K (12.45) 

Notice that the + and — signs in (12.44) are correlated with the value of the total spin 
so that the two-particle state is antisymmetric under exchange. The J term, which 
is manifestly positive, is similar to the expression (12.24) that we obtained when we 
calculated the shift of the ground-state energy. As in (12.28). we can describe this 
term as the charge density of one of the electrons interacting with the charge density 
of the other. 

The K term, however, is pure quantum mechanics. There is no classical inter¬ 
pretation that we can assign to this term. It arises, as we have seen, because of the 
identical nature of the particles. We can argue that K must be positive. Note that if 
we put r, = r 2 in the wave functions in the first line of (12.44). we find that the anti¬ 
symmetric wave functions vanish, while the symmetric wave functions add together 
constructively. Thus the electrons in the antisymmetric spatial state tend to avoid 
each other in space, which should lower their energy due to Coulomb repulsion rel¬ 
ative to the electrons in the symmetric spatial state, in which the electrons prefer to 
be close together. Thus the existence of such an exchange term produces an energy 
shift of the total-spin states: the energy of the triplet of spin-1 states is shifted by 
J — K, while the singlet spin-0 slate is shifted in energy by J + K. Provided K is 
positive, the energy of the spin-1 states will be lower than the spin-0 state, as we 
have argued physically should be true. 
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Figure 12.4 An energy-level diagram of 
the first excited states of helium. 


If we evaluate the integrals in (12.44), we find 5 

= J u,2s ± K\s,2s = 11-4 eV ± 1.2 eV (12.46a) 

and 

E\" 2p = J uap ± K U i p = 13.2 eV ± 0.9 eV (12.46b) 

Adding these first-order corrections to (12.34), we obtain —56.6 cV ± 1.2 eV for the 
Is, 2s states and —54.8eV ± 0.9 eV for the Is, 2 p stales. The observed values, shown 
in Fig. 12.4, are —58.8 eV ± 0.4 eV for the Is, 2s states and —57.9 eV ± 0.1 eV 
for the Is, 2 p states. Thus there is almost a 1-eV energy difference between the 
3 S | and ’So states. Not surprisingly, as for the ground state, the agreement between 
our first-order perturbative results and experiment is not excellent. Nonetheless, our 
results show two striking features: not only have spin-dependent energy splittings 
been generated from a Hamiltonian that did not involve the spins of the particles at 
all, but the magnitude of this triplet-singlet splitting is much larger than is generated 
from the spin-spin interactions of the magnetic moments of the two electrons (see 
Problem 12.3). A similar mechanism is presumably responsible for the large spin- 
spin interaction that aligns the spins in a ferromagnet. However, it is much more 
difficult to calculate these effects for ferromagnetic materials. 


THE VARIATIONAL METHOD 

In practice, detailed calculations of the energy levels of helium are carried out using 
the variational method. This is a simple but powerful technique that can be used 


5 See J. L. Powell and B. Crasemann, Quantum Mechanics, Addison-Wesley, Reading. MA. 
1961, p. 457. 
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in a variety of problems. We start with the expectation value of the energy in an 
arbitrary state | \j/)\ 

(E) = (r//\H\x/,) (12.47) 


where we have assumed that — 1. Although in practice we are not able to 

determine the exact eigenvalues £„ and corresponding eigenstates ! E n ), in principle 
we can express ]%!/) as the superposition 

\x/,) = J2c n \E n ) (12.48) 

n 

Then 

{£) = Y, l c »l l£ n > E K\ 2 ^ = E q (12.49) 

n n 

where we have assumed that E„ > £ 0 , with £ 0 the exact ground-state energy. Thus 
for any state |i//) 

E 0 <W\m) (12.50) 


The key to the variational method is to choose a trial state a 2 , a 3 ,. ..)), which 

depends on parameters o'), a 2 , 0 : 3 , ..., and then vary the parameters to minimize 
{£) for this state. In this way we can zero in on the ground-state energy. 

We now use the variational method to determine the ground-state energy of 
helium. In choosing our trial state, we must keep in mind two goals: we want to 
pick a state that is not too far away from the exact state and one for which we can 
actually evaluate (£). For helium, a good starting choice is 

W> = |l,0 f 0(Z)>,|l,0,0(Z))2 (12.51) 


where the state 11, 0, 0(Z)) is the single-particle ground state of a hydrogenic atom 
with charge Z, which we will take as the variational parameter. In position space 


(r|1, 0, 0(Z)) = —j= 
s/n 



(12.52) 


In evaluating (£) = (\J/ \H |^). it is convenient to group the terms in the Hamiltonian 
as follows: 


at 

PI _| 

. pi 

N 

tO 

-7 T 2 

Ze~ e L 

_j 

2 m e 

2 m e 

<u 

|r 2 l ’ If, — r 2 l 


/v 7 

P7 

Ze 2 
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Ze 2 ' 

+ 

"(Z — Z)e 2 , (Z-Z)e 2 

2m e 

Ifil 

2 m c 

|f 2 l 

1 

jji> 

10 


(12.53) 
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w here the term in the first brackets is just the sum of two hydrogenic Hamiltonians 
with charge Z, whose expectation value is straightforward to determine for the state 
(12.52). 6 Then 


Setting 


{£) = |m,cV(-2Z 2 + 4Z(Z - Z) + f Z) 
= \m e c 2 a 2 (2Z 2 - 4ZZ + |Z) 


m=o 

9Z 


we find the value of Z that minimizes the energy: 


Z = 7 — — 
^ ^ 16 


Substituting this result into (£), we obtain 

£ 0 < -\m e c 2 a 2 \l{Z - ^) 2 ] = -77.4 eV 


(12.54) 


(12.55) 


(12.56) 


(12.57) 


which is much closer to the observed value £ exp = —79.0 eV than our earlier first- 
order perturbative result (12.32). Notice that (12.56) has the simple interpretation 
that each electron is partially screened from the nucleus by the presence of the other 
electron in the atom. As before, we have put Z = 2 in (12.57) to obtain the numerical 
result. 

The variational method l or the ground state of helium can be improved with more 
complicated trial wave functions. In fact, Pekeris has used a wave function involving 
1075 parameters to obtain numerically an estimate for the ground-state energy that 
agrees with the experimental results to within the experimental errors. 7 In general, 
the reason for working so hard to obtain such good agreement is that then one can 
use the wave function that has been deduced to calculate other quantities, such as 
the lifetimes of excited states. 

We can also estimate the energies of these excited states and obtain the corre¬ 
sponding wave functions using the variational method. Note in (12.48) that if we 
choose a trial state \ijr} that does not have any of the true ground state in it, that is, 
if it is orthogonal to the ground state: 


<W> = 0 


(12.58) 


6 Note: The Hamiltonian itself does not depend on Z. We have added and subtracted terms 
such as Ze*/lf|l to make the evaluation of (i/\H\i!/) as easy as possible. For the variational method 
to work, the variational parameter cannot be a parameter appearing in H. 

7 C. L. Pekeris, Phys. Rev. 115, 1216 (1959). 
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then the expectation value of the energy in this trial state satislies (£) > E h and 
we can obtain an estimate of the energy E\ by minimizing this expectation value. 
Sometimes it is easy to satisfy the condition (12.58). For example, if the excited state 
has nonzero orbital angular momentum, choosing a trial wave function involving the 
appropriate spherical harmonic for the angular dependence automatically guarantees 
that this state is orthogonal to the ground state. Or, if necessary, we can determine 
an excited-energy trial state that is orthogonal to the trial state \\j/) that we obtained 
from estimating the ground state energy. We choose a new trial state \(p) and then 
explicitly construct a state that is orthogonal to the trial ground state jt/r): 


, I <P) - I ^){^\<P) 
I <P) -* , 


(12.59) 


'i 

However, since we are using trial state \\j/) rather than the true ground state in this 
superposition, we should not expect minimizing the expectation value of the energy 
in this state to yield an upper bound for £j. 


EXAMPLE 12.2 Use the variational method to estimate the ground-state 
energy of the hydrogen atom. 

SOLUTION Of course, we already know the ground-state energy of hy¬ 
drogen. Our goal here is to show how to implement the variational method 
in a step-by-step approach. The Hamiltonian for the hydrogen atom is 



|f| 


As a first step, we need to make a judicious choice of the trial w ave function. 
Since we are looking for the ground state, we need to choose a spherically 
symmetric wave function, since we know the ground state has / = 0. More¬ 
over, for a spherically symmetric potential energy, we know that for small 
r the radial w'ave function R(r) behaves as r l . Finally, since our goal is to 
carry out the calculation analytically, we want to choose a wave function for 
which we can actually do the integrals needed to evaluate (£). There are 
two likely possibilities for the trial wave function, a simple exponential and 
a Gaussian. Here we will use the trial wave function 

R(r) = Ne~ br 

where N is the normalization constant. 

First, we determine N: 

pOO poo 1 

1 = J Q r 2 dr\R(r)\ 2 =\N\ 2 j r 2 dr e~ 2hr = \N\ 2 — 
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Choosing 

N - 2b i/2 

the full trial wave function is therefore 


1 b i/2 h 

* = R(r)Y M = —= R(r ) = —e~ br 
s/4n sjn 


Consequently, 

(E) = (x/f\H\ir) 


h 2 b 2 . hrb e 2 \ 
Hr r ) 


= 4b 3 | 

poo 

r 2 dre 

~ br ( 

J 

0 

V 



h 2 h 2 

= 4h~ 

/ dr - 

J 

o V 

2/i 

hrb 2 

-e 2 b 


' 2 n 




-hr 


»-2 hr 


Minimizing the expectation value of the energy: 

d(E) _ fft 
db n 

leading to 


**> -**-«*« 0 


b = 


/xe~ 
1V 


Substituting this value for b into the expression for (E), we see that 


r ^ ^ 1 2 2 

En<{t) = - - = - UCCt 

2 n 2 2 

where a = e 2 /(hc) is the fine-structure constant. Since the exact ground-state 
energy of the Hamiltonian is ~nc 2 a 2 / 2, the variational method in this case 
has yielded the exact energy. This is a consequence of our using a trial wave 
function that included the ground-state energy eigenfunction for a particular 
value of the parameter b. If we had used the same trial wave function for the 
three-dimensional harmonic oscillator, the upper limit on the ground-state 
energy would not be the exact eigenvalue. See Problem 12.8. 
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12.3 Multielectron Atoms and the Periodic Table 


Let’s turn our attention to multielectron atoms. Even if we neglect the contribution 
of the kinetic energy of the nucleus (with charge Ze), then, as for helium, the energy 
eigenvalue equation for the Hamiltonian 


"=£ 


JL 

, 2m e 




;=] 


><j 


r, - r : 


(12.60) 


is too complicated to solve exactly. Of course, if we could neglect the contribution 
of the Coulomb repulsion between the individual electrons, the solution would 
be straightforward. The Hamiltonian would then just be the sum of Z individual 
Coulomb Hamiltonians (with charge Ze on the nucleus for each), and the allowed 
eigenstates could be formed from a direct product of these individual Coulomb 
eigenstates, provided we require that the total state, including spin, is antisymmetric 
under exchange of each pair of electrons. However, there is no reason to expect 
that we can treat the Coulomb repulsion of the electrons as a perturbation, as we 
attempted for helium. In particular, the typical distance separating the electrons in 
the atom should be of the same size as their distance from the nucleus, and although 
the size of the mutual interaction between each pair of electrons should be smaller 
than the interaction of the electrons with the nucleus because of the factor of Z in 
the nuclear charge, there are Z(Z — 1)/2 different pairings of the Z electrons to take 
into account. Thus, even for modest Z we should expect the mutual interaction term 
of all the electrons in the atom to be comparable in size to the interaction of these 
electrons with the nucleus. 

We clearly need an alternative way of dealing with the mutual interaction of the 
electrons. One approach, first used by Hartree, is to treat each of the electrons as 
moving independently in a spherically symmetric potential energy V(r ) due to the 
nucleus and the other Z — 1 electrons. This potential energy should have the form 


V(r) = 



— r -» oo 
r 


(12.61) 


because for small r the electron sees only the nucleus while for large r the nucleus 
is shielded by the other Z — 1 electrons. How do we determine the form for this 
potential energy, since it depends on the charge distribution of the electrons, which 
is itself determined by solving the Schrodinger equation? One approach is to guess 
a reasonable form for V(r) and then solve the Schrodinger equation (numerically) 
to determine the wave functions. As in (12.25), we can use these wave functions to 
determine a charge distribution, which can then be used to determine the potential 
energy that each electron experiences. We cm continue with this procedure until we 
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obtain a self-consistent solution—namely, the potential energy that we determine 
from charge distribution of the electrons is the same, to a given accuracy, as the 
potential energy that we used to determine the wave functions that yielded this charge 
distribution. 

We can label the energy eigenstates by the three quantum numbers n. I. and m, 
just as we did for the hydrogen atom. However, since the potential energy does not 
have a simple 1/r dependence, states with a particular n and different / do not have 
the same energy. In particular, states with lower / should have lower energy, since 
the lowering of the centrifugal barrier with lower / permits these wave functions to 
penetrate more deeply inside the charge distribution formed by the other electrons 
and thus “see " at least at small r. the full attractive potential energy of the nucleus 
with its charge Ze. Thus the accidental degeneracy of the hydrogen atom disappears. 
The independence of the energy on m persists, however, since the potential energy 
is taken to be spherically symmetric. 

Let’s discuss the ground-state electronic configuration of the elements, especially 
those of low Z. Start with the first row of the periodic table, showm in Fig. 12.5. 
Hydrogen has n = 1 and / = 0 for the ground state. We call this a 1a electron 
configuration. For helium, as we saw in Section 12.2, we can put the two electrons 
both in the l.v state, which we now denote by l.v 2 as a shorthand notation for 1a 1a. 
Of course, the spin state must be a lotal-spin-singlet state in order to make the total 
state antisymmetric under exchange. These two electrons fill the n = I shell. It takes 
more than 19 eV to excite one of the electrons to the n = 2 level and 24.6 eV to 
ionize the atom by removing one of the electrons. Helium is exceptionally stable 
and, correspondingly, not chemically active. 8 

The next element in the periodic table, lithium, has three electrons, but we cannot 
add this third electron to the l.v level, for then two of the electrons would be in the 
same state, since there are only two possible spin states, spin up and spin down, 
for each of the electrons. Thus the total state could not be completely antisymmetric 
under exchange of any pair of the electrons. One of the electrons must therefore be in 
the next highest energy state, the 2.v slate, which has lower energy than the 2 p state. 
We label this electron configuration by 1a 2 2a. It takes only 5.4 eV to ionize lithium 
and thus lithium is chemically quite active. The ground slate of the next element 
in the periodic table, beryllium, is 1a 2 2a 2 , while boron with five electrons is in the 
l.v 2 2 a- 2 2 p state. The next five elements —C, N, O, F, and Ne — fill up the 2 p level, 
which can accommodate 6 electrons, since for Z = 1, we can have m = 1, 0, and —1 
and two possible intrinsic spin states for each of these orbital states. Figure 12.6 
shows the ionization energy for each of the elements. Notice that as Z increases 
within a shell, the electrons are pulled in toward the nucleus and the ionization energy 


x See also the discussion at the end of Section 12.4. 
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Figure 12.5 The periodic table of the elements, including the electronic configurations. 
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Figure 12.6 The ionization energy of the elements. 


increases. Neon, like helium, has a completely filled shell and a high ionization 
energy of 21.6 eV. The charge density for a closed shell 

l 

£ e\R„j(r)\ 2 \Yi m (0, 0)| 2 (12.62) 

m=—l 

is spherically symmetric since 

T = 02 . 63 ) 

m=—l 4 * 

Thus the electronic charge effectively shields the nuclear charge. Although exciting 
one of the electrons to an excited state would change this situation, the energy gap 
between the n = 3 and n = 2 levels is sufficiently large that the atom has little affinity 
for other electrons and is chemically quite inert. 

The third row of the periodic table starts with sodium. After filling the n = 2 
shell, the eleventh electron must go into the n = 3 level. Again, the 3s level lies 
lower than the 3 p level and thus the electron configuration is l.t 2 2 s~ 2p 6 3,v. Since 
this last electron is in a new shell, it is primarily shielded from the nucleus by the 
inner ten electrons and thus, like lithium, has a low ionization energy (5.1 eV) and 
a high chemical activity. Both lithium and sodium are alkali metals, which are quite 
reactive chemically. As an example, if fluorine is present, sodium sees a natural home 
for its “extra” electron; it can donate it to fluorine completing the n — 2 shell for that 
element. These ions then bind together through electrostatic attraction, forming an 
ionic bond. In moving from sodium to argon along the third row of the periodic table, 
the 5 and the p levels fill up just as they did in going from lithium to neon along the 
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second row. and thus the chemical properties of these elements are strikingly similar 
to the ones above them in the periodic table. 9 In particular, chlorine, like fluorine, 
is a halogen that needs just one electron to complete a shell—the 3p shell in this 
case. Chlorine can take that electron from sodium, producing sodium chloride, or 
ordinary salt. Argon, like neon, has a closed shell and is relatively inert; it is one of 
the noble gases. 

The next row in the periodic table contains some surprises. Instead of fillin g 
the 3 cl level next, as you might expect based on the energy levels of hydrogen, 
the 4s electrons penetrate the electron cloud of the inner electrons and have their 
energy pulled down below that of the 3 d level. In fact, there is very little separation 
between the 4.y and 3d levels. By the time the 3d level is filled with four electrons, 
the interaction between the electrons raises the energy of the 4s level so that it is 
slightly above the 3d level. Thus chromium has an outer shell electron configuration 
of 4,y 1 3d 5 instead of As 2 3d 4 . The next few elements—manganese, iron, cobalt, 
and nickel—have a filled 4s level with electronic configurations ranging from 4s 2 
3c/ 5 to 4s 2 3c/ 8 . The chemical properties of these elements are all similar. Since the 
average radius of the 3d electrons is somewhat less than the 45 electrons, the outer 45 
electrons tend to shield the inner 3d electrons from outside influences. 10 At copper, 
which is 4.V 1 3c/ 10 , the pattern shifts again, with one of the electrons from the 4.v level 
shifting to the 3d level. However, the two configurations 4.v 2 3c/ 9 and 4.v' 3c/ 10 are 
so close together in energy that copper can behave as if it has one or two valence 
electrons depending on its chemical environment. Finally, after both the 45 and 3d 
levels are filled, the 4 p level fills, repeating the pattern of the previous two rows. 

One of the triumphs of quantum mechanics is that it provides us with a detailed 
understanding of the physics responsible for the periodicity in the chemical proper¬ 
ties of the elements. 

12.4 Covalent Bonding 


In our discussion of chemical activity of the elements, we have indicated that certain 
elements can bind ionically together through electrostatic attraction after they have 
transferred an electron from one of the elements to the other. There is another type of 
bond, the covalent bond, in which the elements actually share rather than exchange 
their electrons. This sort of bonding is pure quantum mechanics in action. To see 
how it arises, we first consider as a specific case the positively charged hydrogen 


9 The ionization energies are slightly less, since n = 3 instead of 2. 

1(1 This effect is even more pronounced in the sixth and seventh rows of the periodic table. The 
4/ electrons do not fill up until after the 6s shell. Thus the rare earth elements with Z ranging 
from 57 to 71 have similar chemical properties. Also, the 5/ electrons do not fill until after the 7 s 
shell, leading to very’ similar chemical properties for the actinides, Z = 89 to Z = 103. 
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molecule ion, where a single electron is shared by two protons; then we consider the 
more prototypical case of the hydrogen molecule, where two electrons are shared. We 
will see that the identical nature of the electrons plays a crucial role in this covalent 
bonding. 


THE HYDROGEN MOLECULE ION 

Although molecules, even simple diatomic molecules, are complex systems with 
many degrees of freedom, fortunately there are approximations that we can make 
that make the problem of determining the bound states of molecules a reasonably 
tractable one. In particular, recall from Section 9.7 that the energy of vibration 
of the nuclei of a diatomic molecule is on the order of (m e /m N )^ 2 smaller than 
the electronic energy of the molecule. Thus the typical period of the motion of 
electrons in the molecule is much shorter than that of the nuclei, and we can neglect 
the motion of the nuclei as a first approximation, part of the Bom-Oppenheimer 
approximation. For a diatomic molecule we then consider the behavior of electrons 
under the inlluence of two fixed nuclei separated by a distance R. In fact, we can 
take R as a parameter that also appears in the wave function and that we can 
adjust using the variational method to determine the value of R that minimizes 
the energy of the molecule, thus determining both the energy and the size of the 
molecule. 

A natural trial wave function for the hydrogen molecule ion H+ is determined 
by first considering the lowest energy state of the system when the two protons are 
widely separated. Then there are clearly two possible states: either the electron is 
attached to one of the protons, forming a hydrogen atom in the ground state, or 
the electron is attached to the other proton, again in the ground state of a hydrogen 
atom. These two states are indicated in Fig. 12.7. In terms of the coordinates shown 
in Fig. 12.8, the corresponding position-space wave functions are given by 


<r|l) = 



-|r-R/ 2|/« 0 


<r|2) = 



-|r+R/ 2 |/«o 


(12.64a) 

(12.64b) 


There are, of course, many other possible states of the system that we are neglecting 
in which, for example, the electron is in an excited state of the hydrogen atom. 

What is the proper linear combination of states 11) and |2) to use for the variational 
method? Here, as was the case for the N atom in the ammonia molecule that we 
treated as a two-state system in Chapter 4, there is an amplitude for the electron 
attached to one of the protons to jump to the other proton. This amplitude means 
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Figure 12.7 A schematic representation of the two states 
used as a basis for a variational calculation of the ground 
state of the hydrogen molecule ion. In |1) the electron 
combines with one of the protons to form a hydrogen atom 
in its ground slate, while in |2) it combines with the other 
proton, again forming hydrogen in the ground state. 


e 



Figure 12.8 The coordinates of the two protons and 
the electron used in the discussion of the hydrogen 
molecule ion. 


that the matrix representation of the Hamiltonian 



If - R/2| 


2 

e 


|f + R/2| 



(12.65) 


using the states |1) and |2) as a basis will have off-diagonal matrix elements, similar 
to what we assumed in our treatment of the ammonia molecule. Here, because of 
the relative simplicity of the H~ ion. we can actually calculate the matrix elements: 


#11 = 01 


— --) 0 ) - ( 1 (———- 11 ) + — 

,2m, |f - R/2|y |f + R/2| R 

/ 2 2 

dr’r --- 1 <r 11)| 2 + — 

|r + R/2| R 

where E [ is the ground-state energy of the hydrogen atom: 

= < 2 | ( -) | 2 > - ( 2 | ——— 

" \2m e |r + R/2| / |f — F 

= £,- [ d*r -—-1(r|2>| 2 + - = //„ 

J |r — R/2| R 


(HD 


( 12 . 66 ) 


| 2 ) + —( 2 | 2 ) 
R/2| R 


(12.67) 
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w here the last step follows from the symmetry of the two configurations; 




e 2 e 2 

| 2 ) — ( 1 |—- 12 ) + -( 1 | 2 ) 

|r-R/2| R 


= \E 1 + 

where the amplitude 


d\( l|r) e 


r-R/2| 


<r|2> 


< 1 | 2 ) 


-h 1 


f (l|r><r|2) 


( 12 . 68 ) 


(12.69) 


is called the overlap integral. Note that the nonvanishing of the off-diagonal matrix 
element depends on the states |1) and |2) overlapping in space. Since the wave 
functions are real, the off-diagonal matrix elements are equal: 


2 2 

(2| ———-11) + — <2|1> 

|r + R/2| R 


H n = (2| (¥- - -r-^- - \ |1> - 

V2 m e |f-R/2| ) 

= ( £l + ^)(2|l)-/^(2|r) j ^-, 


(12.70) 


Because H u = H 22 as well as H l2 — H 2 j, the linear combinations of the states |1) 
and |2) that diagonalize the Hamiltonian are 


|±) = 1 =(|1> |2» (12.71) 

v/2±2(l|2) 

where the overall factor is needed to normalize the slates because the basis states 
are not orthogonal (see Problem 12.10). Note from the form of the wave func¬ 
tions (12.64) that the wave function (r|+) has even parity, while the wave function 
(r|—) has odd parity. We could have selected the two states (+) and |—> initially as the 
proper linear combinations from the inversion symmetry of the Hamiltonian (12.65). 
Figure 12.9 shows a sketch of the wave functions, and Fig. 12.10 shows the expec¬ 
tation values of the energies 


£+ = 


1 


l±(l|2> 


(H U ±H V J 


(12.72) 


plotted as afunction of R. Only the even parity' state has a minimum, which occurs for 
a separation of 1.3 A for the protons, corresponding to a binding energy of 1.8 eV. 
Thus the state |+) is referred to as a bonding molecular orbital, while the state 
is called an antibonding molecular orbital. These molecular orbitals are linear 
.' anbinations of atomic orbitals. Note from Fig. 12.9 that only for the bonding orbital 
iv the electron shared between the two protons. For the anti bonding orbital, on the 
other hand, there is a node in the wave function midway between the two protons 
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Bonding 


(b) 


Antibonding 


Figure 12.9 The wave functions for the bonding and antibonding orbital of the hydrogen 
molecule ion plotted (a) along the axis connecting the two protons and (b) in three 
dimensions. 


where the potential energy is quite negative, and thus the electron in this state doesn't 
benefit from the full attraction of the two protons. 

The experimental separation between the protons for the hydrogen molecule ion 
is 1.06 A, with a binding energy of 2.8 eV. The reason for the lack of better agreement 
between our variational results and experiment resides in our choice of trial wave 
function. In particular, notice that as /? —> 0, the system reduces to He + , while 
our trial wave function remains the lx ground state of hydrogen. Thus we should 
not be surprised that we have underestimated the size of the binding energy and 
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R( A) 


Figure 12.10 The energies of the bonding and antibonding orbitals as a function of the 
interproton separation R. 


overestimated the size of the molecule. One way of doing better is to use a trial wave 
function with an effective charge Z (as in Section 12.2) as well as the interproton 
distance R as parameters. Nonetheless, our solution demonstrates the key qualitative 
features of the binding. 

MUON-CATALYZED FUSION 

Note that the interproton separation in the hydrogen molecule ion is on the order 
of the Bohr radius a 0 of the hydrogen atom. This length scale enters in the trial 
wave functions (12.64) that we used in our variational approach to the molecule. 
Since a 0 = h/nca. the size of the molecule depends on the reduced mass p of 
the corresponding atom. In particular, suppose we were to replace the electron 
in the hydrogen molecule ion with a muon. Then the reduced mass is given by 
m n m p/( m n + m p) instead of rn e m p /(m e + m p ). Since the reduced mass for the 
muonic atom is roughly a factor of m p /m e larger than the reduced mass of hydrogen, 
the interproton separation in a muonic hydrogen molecule ion should be a factor 
of m e /m p smaller than for the molecule with an electron generating the binding. 
Replacing the electron with a muon produces a much smaller molecule, just as 
replacing the electron with a muon in the hydrogen atom produces a much smaller 
atom. 

Suppose that the nuclei of this diatomic molecule consist of two deuterons instead 
of two protons. The small size of the muonic molecule means that the deuterons 
have a significantly greater probability of being close together than is the case when 
an electron is responsible for the binding of the molecule. In fact, for this muonic 
molecule nuclear reactions such as 

d + d—*t + p + 4.0 MeV (12.73) 
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have been observed to take place. This is a typical fusion reaction of the sort that is 
contemplated as a plentiful energy source (using the deuterium naturally present in 
sea water). However, w'hereas the attempts to generate fusion commercially depend 
on thermonuclear reactions generated by heating the deuterons to sufficiently high 
temperatures that they have a significant chance of overcoming their Coulomb 
repulsion and being close enough together (on the order of a fermi) to permit the 
reaction (12.73) to occur, the quantum mechanical binding in the muonic molecule 
does this at ordinary' room temperature. Thus this muon catalysis can be described 
as a form of “cold fusion.” Unfortunately, muons themselves are not freely available 
(except in cosmic rays) and thus an accelerator is required to generate the energy 
required to produce muons, for example, in the form of muon-antimuon pairs. The 
only way for reactions such as (12.73) to be a net generator of energy is for each 
muon to form a large number of muonic molecules in which it catalyzes a nuclear 
reaction before the muon decays in roughly 2.2 microseconds. 11 So far this has not 
been feasible, but work is still in progress. 


THE HYDROGEN MOLECULE 

We are now ready to turn our attention to the hydrogen molecule, Hi, where we 
must examine the effect of the identical nature of the two electrons on the binding of 
the molecule. Although we won't spell out the details here, we can use an approach 
similar to the one we used for the hydrogen molecule ion to understand the binding 
of the hydrogen molecule. For Hi, each hydrogen atom in the molecule supplies, 
of course, one electron. For each of the electrons to be in the same region of space 
between the two protons, the spatial state must be symmetric under exchange of 
the two particles, and consequently the total-spin state must be a spin-0 state. With 
two electrons being shared instead of one, the molecule has a binding energy' of 
4.7 eV, as compared with 2.8 eV for H+, despite the extra Coulomb repulsion of the 
electrons. The interproton separation for the molecule is 0.7 A, as compared with 
1,3 A for the ion. Binding occurs only for the total-spin-0 state; the total-spin-1 state, 
corresponding to an antisymmetric spatial state in which the electrons, in general, 
do not reside together in the region between the two protons, does not exhibit a 
minimum in the total energy for any separation of the protons. Thus repulsion rather 
than binding occurs in this case. 

We can now also see why, for example, a hydrogen atom and a helium atom do 
not bind together to form a molecule. The two electrons in the ground state of helium 
are both in the same spatial Is state, and therefore their total-spin state is the singlet 
spin-0 state. We say these two electrons in helium are paired together. The electron in 
the hydrogen atom cannot form a covalent bond and pair up with either one of these 


11 L. Alvarez etal., Phys. Rev. 105. 1127 (1957). For an entertaining account of Luis Alvarez’s 
discovery of muon-catalyzed fusion, see Nobel Lectures — Physics, vol. II, Elsevier, New York, 
1969. 
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electrons, since this would either mean that three electrons are in the same spatial 
state (which is forbidden) or else one of the electrons would have to be excited to 
the 2s state of helium, which is energetically quite costly. On the other hand, as we 
have seen in the hydrogen molecule, if the electron from hydrogen interacts with 
an electron from helium in a total-spin-1 state, repulsion occurs between the two 
atoms. This is the reason helium is an inert element without affinity for other atoms, 
unlike an element with an unpaired outer electron that can pair up with an electron 
from another atom to form a covalent bond. Moreover, once two unpaired electrons 
from different atoms have paired up to form a covalent bond, an electron from a third 
atom cannot pair up with either one of them. Thus we see why the chemical forces 
saturate. 

* 

12.5 Conclusion 


Most of our attention in this chapter has been devoted to systems containing iden¬ 
tical fermions. The requirement that the state of the system be antisymmetric under 
the exchange of any two identical fermions means, in particular, that two identical 
fermions cannot occupy the same state (the Pauli principle). For identical bosons, 
on the other hand, the state of the system must be symmetric under exchange of any 
two of the particles. Consequently, it is possible, and in fact preferred, to have many 
identical bosons in the same state. In Chapter 14 we will see an example when we 
discuss how a laser operates. Other examples in which many bosons condense to the 
ground stale at sufficiently low temperatures include superconductivity and super¬ 
fluidity. 12 Like the laser, these phenomena are interesting and exciting macroscopic 
manifestations of purely quantum behavior. 

Problems 


12.1. Verify for an antisymmetric spatial state under exchange of two particles that 
If a) = \ f d 3 r y d 3 r 2 0 =|r lt r 2> - ^l r 2. r i>) 

x Tz ^ a) ~ r H 

= J </■>, d\ |r„r 2 )<ri, r 2 |\^) 


- This condensation is often referred to as Bose-Einstein condensation, although this term 
should probably be restricted to the condensation to the ground state that takes place when a 
dilute gas of atoms confined in a trap is cooled to temperatures in the microkelvin range. See J. S. 
Tow nsend. Quantum Physics: A Fundamental Approach to Modern Physics, University Science 
Books, Sausalito. CA, 2010, pp. 242-247. 
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12,2. Two identical, noninteracting spin-| particles of mass m are in the one- 
dimensional harmonic oscillator for which the Hamiltonian is 


H = — + -mu> 2 x: + + -nuo 2 x~ 

2m 2 1 2m 2 2 


(a) Determine the ground-state and first-excited-state kets and corresponding 
energies when the two particles are in a total-spin-0 state. What are the lowest 
energy states and corresponding kets for the particles if they are in a total- 
spin-1 state? 

(b) Suppose the two particles interact with a potential energy of interaction 


V'd-H ~x 2 \) - 


~ v o l*i ~x 2 \ <a 
0 elsewhere 


Argue what the effect will be on the energies that you determined in (a), 
that is, whether the energy of each state moves up, moves down, or re¬ 
mains unchanged. Suggestion; Examine which spatial wave functions for the 
total-spin-0 and total-spin-1 states tend to have the particles closer together. 
Consider, for example, the special case of X\ = x 2 . 


12.3. Obtain an order-of-magnitude estimate for the singlet-triplet splitting of the 
energy levels of the two electrons in helium due to a direct spin-spin interaction. 
Suggestion: Compare with the magnitude of the hyperfine interaction in hydrogen 
as discussed in Chapter 5. 


12.4. Use the variational principle to estimate the ground-state energy for the one¬ 
dimensional anharmonic oscillator 

H = — + bx 4 
2 m 

Compare your result with the exact result 

E 0 =,.060^(-) 

12.5. For the delta function potential well 

2m , r , „ X „. , 

V{x) = --8(x) 
h- b 

use a Gaussian wave function as a trial wave function to obtain an upper bound for 
the ground-state energy. Compare with the result of Problem 6.19. 


12.6. Consider the one-dimensional system of a particle of mass m in a uniform 
gravitational field above an impenetrable plane. Take the potential energy to be 
infinite at the plane and locate the plane at z — 0. 
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(a) Plot the potential energy of the particle. What is the Hamiltonian? Sketch 
roughly the ground-state wave function, 
i bi Use an appropriate trial wave function to estimate the ground-state energy, 
c i What is the average position (z> of the particle above the plane? 

12.7. Use as a trial wave function the Gaussian 



to obtain an estimate of the ground-state energy for the linear potential energy 
Vfx) =a\x\. 

12.8. Use the trial wave function R(r) = Ne~ br to estimate the ground-state energy 
of the three-dimensional isotropic harmonic oscillator, for which V(r) = jnco 2 r 2 . 
Compare your estimate with the exact value (see Section 10.5). 

12.9. A muon and a proton are bound together in the ground state of a muonic 
“hydrogen atom.” As this atom diffuses around, it bumps into a deuteron. What is the 
incentive in terms of energy for the muon to jump from the proton to the deuteron, 
forming another muonic atom, but this time with the deuteron instead of the proton 
as the nucleus? Explain. 

12.10. Show that the linear combinations of states that diagonalize the Hamiltonian 
of the hydrogen molecule ion arc given by (12.71). Verify that these states are 
properly normalized and that the corresponding energy expectation values are given 
by (12.72). Note: If | \j/) is not normalized, then ( E) = (\j/\H\\J/)/(\j/\i//). 
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Scattering 


How do wc learn about the nature of the fundamental interactions on a microscopic 
level? Solving the Schrodinger equation for the hydrogen atom yields an infinite set 
of energy levels that gives us proof that the Coulomb interaction is the predominant 
interaction between an electron and a proton on a distance scale on the order of 
angstroms. However, for the two-body problem in nuclear physics, as we discussed 
in Chapter 10, there is a single bound state, so we must resort to scattering techniques 
to learn about the nature of the nuclear force. After all, it was through a scattering 
experiment that Rutherford first discovered the very existence of the nucleus within 
the atom. Subsequently, scattering has played a major role in helping us learn 
about nuclear physics and particle physics, as well as more about atomic physics. 
After introducing the concept of the scattering cross section, we will use the Born 
approximation and the partial wave expansion to calculate the cross section in 
quantum mechanics. These two approaches are in a sense complementary to each 
other: the Bom approximation works best at high energies and the partial wave 
expansion has its greatest utility at low energies. 

13.1 The Asymptotic Wave Function and the 
Differential Cross Section 


In a typical scattering experiment, a beam of particles, often from an accelerator, is 
projected at a fixed target composed of other particles. In Rutherford’s experiment, 
the incident particles were or particles, 4 He nuclei, that were emitted in radioactive 
decay, while the target consisted of gold atoms in the form of a thin gold foil. A 
schematic diagram of this experiment is given in Fig. 13.1. The angular distribution 
of the scattered a particles provided clear evidence of the existence of a relatively 
massive gold nucleus. More recently, experiments done at the SLAC National Ac¬ 
celerator Laboratory with high-energy electrons accelerated in the two-mile-long 
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Figure 13.1 A schematic diagram of the Rutherford scattering experiment. 


accelerator to 20 GeV as the incident projectiles and target protons in the form of liq¬ 
uid hydrogen revealed that protons were actually composed of fractionally charged 
constituents, which are called quarks. A convenient way to describe such experi¬ 
ments is to picture the incident particle, initially far from the target, at least on a 
microscopic scale, as an essentially free particle that interacts with the target only 
when it is within the range of the potential energy of interaction V(r), which we 
will take to be spherically symmetric. Just as a comet can be deflected by its gravita¬ 
tional interaction with the Sun, so loo can the incident projectiles in these scattering 
experiments be deflected by their interaction with the target particle. Since the par¬ 
ticles interacting through this potential energy V (r) are not bound, the energy of the 
incident particle can take on a continuum of different values, just as it did when we 
analyzed the free particle in one dimension in Section 6.6. As we saw there, physical 
states in the form of a wave packet can be formed from the superposition of these 
continuum states. 

In practice, this wave packet generally has a sharp peak in momentum space about 
some incident momentum p 0 , and consequently the wave packet in position space 
is quite broad. In fact, we will assume that it is sufficiently broad that we can treat it 
as a plane wave for the purposes of analyzing the experiment. This is generally the 
way one-dimensional scattering is discussed w'ithin wave mechanics. Specifically, 
as we discussed in Section 6.10, one considers a potential energy function such as 
the potential barrier shown in Fig. 13.2. Outside the range a of the potential, we can 
express the wave function as a plane wave: 


fix) = 


Ae ikx + Be~ ikx x < 0 
Ce ikx x > a 


03.1) 


where k = f2mE/ti 2 . The time dependence of such an energy eigenfunction is the 
usual This is of course a stationary state. However, by superposing these 

energy eigenstates together, we can produce a wave packet w'ith time dependence 
such that the incident wave packet alone approaches the barrier, and after interaction 
with the banner there is an amplitude for the wave packet to be reflected and an 
amplitude for it to be transmitted, as depicted in Fig. 6.12. Examining the stationary 
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Figure 13.2 A potential barrier of width a in a one¬ 
dimensional scattering experiment. 


states to determine these amplitudes is analogous to doing a scattering experiment 
with water waves in a pond in which the source of the waves is not a single stone 
thrown into the pond (which would generate a wave packet) but a harmonic source 
that continually beats up and down in the water at a steady frequency. 

What is the analogue of (13.1) in three dimensions? We take the incident wave 
to be traveling along the z axis, with the target located at the origin. Far from the 
target the asymptotic wave function should include this incident wave together with 
an outgoing wave produced by interaction with the potential: 

)//-» Ae lkz 4- (outgoing wave) (13.2) 

r—►oc 

(see Fig. 13.3). Outside the range of the potential, where the particle detectors are 
located, the outgoing spherical wave must be a solution to the Schrodinger equation 



Figure 13.3 A schematic diagram of a three-dimensional 
scattering experiment indicating the incident plane 
wave and the outgoing spherical wave. 
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w ith V = 0. Thus, we start with the differential equation (10.4) for the radial wave 
function u = rR with V = 0, 


f'r d~u /(/ + l)h 2 

-— 3-- —u = Eu 

2n dr 2 2fir 2 


(13.3) 


For large r we can neglect the centrifugal barrier term and obtain, for any /, the 
equation 


h 2 d 2 u 

'2^dT 2 


= Eu 


which has the two solutions 

u = e lkr and u = e~' kr 


(13.4) 


(13.5) 


with 


* = 



(13.6) 


If we attach the time dependence e~ lEl ^‘ to each of these solutions, we see that only 
e ,kr corresponds to the outgoing wave, since as time increases, r must also increase 
to keep the phase constant. This outgoing wave is the type that we expect to be 
generated by the interaction of the incident wave with the potential. Since R = u/r, 
this suggests that we express the asymptotic wave function (13.2) in the form 

* -> Ae ikz + Af(&, 0)—= lAinc + V'sc (13-7) 

oc r 

where the function f(6, 0) allows for angular dependence in the outgoing scattered 
wave. After all. we have no reason to expect that the outgoing scattered wave 
should have the same amplitude in the forward direction (0 = 0) as at other angles. 
Also, the spherical harmonics that appear in the energy eigenfunctions (r|F, I, in) — 
R{r)Yj m (6, 0) can add up to produce, in principle, any angular dependence. 

How do we relate this asymptotic wave function (13.7) to what is measured in 
the laboratory? Let’s return to the way the experiments are actually performed. As 
in Section 6.10, the incident flux of particles can be related to a probability current, 
whose form follows from the Schrodinger equation in position space, 

- — V 2 x/s + Vxl/ = ih— (13.8) 

2 ix 3 1 

We start with the probability density 

l(r|0)| 2 = 0'*(r)0(r) (13.9) 
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By evaluating the time derivative of this probability density and taking advantage of 
the form of the Schrodinger equation (13.8). we find that we can write 

|-(*» + V.j = 0 (13.10) 

3/ 

where 


f- 

j=—W) (13.11) 

2 iii 

The dimensions of the probability current j are probability per unit area per unit 
time. Equation (13.10) expresses conservation of probability in the form of a local 
conservation law, since it implies that 

~ j d y r \jr > = - J rf 3 rV.j = -|(/5nj (13.12) 

where in the last step we have used Gauss’s theorem to convert the volume integral 
to a closed surface integral over the surface enclosing the volume. Note that if 
the integral over the surface of the dot product of the probability current j with 
outward normal n to the surface is positive, there is a net outflow of probability from 
the volume, and consequently the probability of finding the particle in the volume 
decreases. 

In Section 6.10 we saw that the one-dimensional probability current for the wave 
function (13.1) is given by 


j = -(\A\ 2 -\B\ 2 ) x<0 
m 

hk 1 

j = —|Cj ,v > a 
m 


Thus the reflection coefficient can be calculated from 

R _j K{ _ (hk/m)\B\- ^\B\ 2 
jinc (hk/m)\A\ 2 |*|2 

while the transmission coefficient is given by 


■Aran _ (M/;»)|C | 2 _ JCp 
jiac ( hk/m)\A\ 2 \A\ 2 


(13.13a) 

(13.13b) 


(13.14) 


(13.15) 


In one dimension, conservation of probability requires that R 4- T = 1. 

In three dimensions, there is more than just reflection and transmission. The 
experimentalist counts the number of particles scattered through angles 9 and <p 
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Figure 13.4 An experimental setup characteristic of a three-dimensional scattering 
experiment. The detector subtends a solid angle dQ. 


that enter a detector that subtends a certain solid angle, as indicated in Fig. 13.4. In 
particular, the differential cross section da is defined by 

, da number of particles scattered into dQ per second , „ , „ 

da = —dQ =---e-- (13.16) 

dQ number of particles incident per unit area per second 

Note that the dimension of the cross section is area. We can think of it as an effective 
area that the target presents to the incident flux of particles. Also note from (13.16) 
that if we multiply this incident flux, which is the number of incident particles per 
unit area per unit time, by the differential cross sectional area da, we obtain the 
number of particles scattered into the solid angle dQ per unit lime. The total cross 
section a is obtained by integrating over all angles: 



This total area may have a simple physical significance in some cases. For example, 
in a nuclear scattering experiment with neutrons as the projectiles and a nucleus as 
the target, the total cross section is on the order of the size of the nucleus, since 
the nuclear force is short range and neutrons that strike the nucleus are likely to 
interact with it. On the other hand, if neutrinos are the projectiles and the target is a 
nucleus, the cross section is many orders of magnitude smaller. This is not because 
the nucleus has suddenly shrunk in size but because neutrinos interact so weakly 
with the nucleus that most of the neutrinos pass right through the nucleus without 
scattering at all. 

The flux of particles in a scattering experiment is proportional to the probability 
current. If we use the asymptotic wave function (13.7) to calculate the probability 
current, we find that the current does not simply break up into two pieces consisting 
of an incident current and a scattered current. Unlike the one-dimensional case 
(13.13), there is interference between the incident wave and the scattered wave. This 
interference in the forward direction (6 — 0) is, in fact, responsible for the reduction 
of flux in the forward direction from its incident value and therefore is necessary 
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Figure 13.5 If the incident “plane” wave extends over a distance b transverse to the z axis, 
a distant detector located a distance r from the origin and at an angle 9 > b/r will nol 
be in the path of the incident beam. As r —*■ oo, such a detector would only detect the 
incident beam as 9 —> 0. 


for scattering to occur at all. 1 Nonetheless, in practice it is possible to calculate 
probability flux entering a detector that is not directly in the forward direction by 
using the scattered outgoing wave function alone. In the actual experiments, the 
incident beam is limited in the transverse direction by the sides of the beam tube, if 
by nothing else. If we call this transverse dimension b, then at a distance r from the 
target the incident wave is present only for angles 6 ^ b/r (see Fig. 13.5). Since the 
particle detectors are located at large distances from the target (r -> oo), the scattered 
wave will be the only wave of interest unless the detector is placed essentially at 
0 = 0, namely, in the incident beam. 

To determine the probability current that flows into a detector that subtends a 
solid angle dQ, we calculate 

jsc = (13.18) 

where the asymptotic form of the scattered wave function is given by 

- >Af(0,(t>)— (13.19) 

r-xoc r 

The scattered probability current is then given by (see Problem 13.2) 

frk 

jsc-► — |A| 2 |/I 2 u r (13.20) 

r-»oo /X/— 

where the unit vector u r show's that this current is radially directed. The probability 
flow' (probability per unit time) into a solid angle dQ. is determined by taking the dot 
product of the scattered probability current (probability per unit area per unit time) 


1 See also the optical theorem in Section 13.4. 
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■ t'n the area u ,.r 2 dQ covered by an infinitesimal detector: 

(13.21) 

Since the incident probability flux, which is directed along the z axis, is given by 

(13.22) 


Jr 1 

j«c* V 2 cM = —\A\ 2 \f\ 2 dQ. 


iinc Ml 


the differential cross section that follows from the definition (13.16) is 

^ rf n = kJ*rd*L l/l 2 ^ 

dtl jinc 


(13.23) 


Thus 


— = \f(0,<P)\ 2 (13.24) 

£ZS2 

It is this differential cross section that replaces the reflection coefficient R used 
to describe one-dimensional scattering. The function /(#, <p ) is referred to as the 

scattering amplitude. 


13.2 The Born Approximation 


A particularly good approach for calculating the scattering amplitude when the 
energy of the incident beam is large in comparison with the magnitude of the potential 
energy is the Born approximation. We begin by expressing the position-space energy 
eigenvalue equation 


V 2fi 


V z + V) f(r) = E^(r) 


in the form 


(V 2 + k 2 )ij/(r) = ~V (r)f(r) 
}>/ 


2fi. 


(13.25) 


(13.26) 


with k given by (13.6). The incoming plane wave Ae' k: is a solution to the equation 

(V 2 + k 2 )yff(r) = 0 (13.27) 

It is convenient to express the formal solution to (13.26) in the form 

if(r) = Ae ikz + J clV G( r, (13.28) 
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The function G(r, r') is called the Green’s function of the differential equation 
(13.26) and itself satisfies the differential equation 


(V 2 + k 2 )G(r, r') = <$ 3 (r - r') (13.29) 

That (13.28) is a solution to (13.26) can be verified by applying the differential 
operator V 2 + k 2 to (13.28). It is only a formal solution since the wave function 
t/r appears on the right-hand side of this equation as well as on the left-hand side. 
Nonetheless, we will see that (13.28) provides a useful route for determining the 
wave function in an iterative procedure known as the Bom approximation. 

We first determine the Green’s function, making sure that the solution (13.28) 
satisfies the appropriate boundary conditions. As indicated by the argument of the 
delta function, the Green’s function itself depends only on the difference of the 
vectors r and r'. With this in mind, let’s first set r' to zero in (13.29) and determine 
the solution to the equation 


(V 2 + k 2 )G(r, 0) = <5 3 (r) (13.30) 

Given the spherical symmetry of this differential equation, we naturally search for 
a solution in spherical coordinates. Notice that, except at the origin, the Green’s 
function is a solution to (13.27). We saw in Section 13.1 that a solution to the 
Schrodinger equation for a free particle in the form of an outgoing wave is given by 

G(r, 0) = C -— (13.31) 

r 

where C is some constant. In fact, (13.31) is actually a solution to (13.30) even at 
the origin, provided we choose the value of C properly. Recall from (10.9) that 

V 2 - = -4jr5 3 (r) (13.32) 


Note that 



r 


v ■ ^ V— j = V • ( \e ikr )- + V • ( V- j e tkr 

-V 2 e ikr + 2Ve ikr ■ ( V-) + e'^V 2 - 
r \ r) r 



4n8 3 (r)e ikr 



r 


4n8 3 (r)e ikr 


(13.33) 
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Detector 


Figure 13.6 The r' integration is restricted to lie 
within the range of the potential energy, which is 
indicated by the shaded region, while the particle 
is detected at r. 


where the evaluation of the action of V 2 on e ,kr 


* 

has been carried out using 


d 


2 a 


V = —- H-F angular derivatives 

dr 2 r dr 


(13.34) 


Thus 


gikr 

(V 2 + k 2 )C — = —477-C<5 3 (r) 
r 


Comparing this result with (13.30), we see that 


G(r. 0) = - 


Jkr 


4 7i r 


Therefore 


G(r. r') = - 


gik Ir-r'l 
47r|r — r'| 


(13.35) 


(13.36) 


(13.37) 


and 


Vr(r) = Ae lkz 




Inti 




3 ' 


d*r 


e ik Ir-r'l 

---V(r')^(r') 

|r - r | 


(13.38) 


Generally, the range of the r' integral is limited by the range of the potential 
energy V (r') to a microscopic distance (see Fig. 13.6). In order to determine the 
scattering amplitude, we need to examine the behavior of the wave function as 
r oc. In this case, since r » |r'|, we can approximate 


— r'| = ( r 2 - 2r • r' + r' 2 ) 1,2 = r ^1 -2u r • — + r — j — —* r ^1 - u r ■ — ^ 


(13.39) 
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where we have neglected terms of order ( r'/r) 2 . Thus in (13.38) we can make the 
replacement 



(13.40) 


since the terms that we are neglecting make a vanishing contribution to the integral 
relative to the one that we have retained as r -* oo. However, within the exponent 
all that matters is the value of the phase kr modulo 2n, no matter how large r is. 
Thus we need to retain both the first two terms in the expansion (13.39), namely, 


e f«jr-r'| _^ e ikr{\-a r -t'lr) _ j ,/*r < ,-/k r T (13.41) 

r—*00 

where ■* 


ky = ku r (13.42) 

points in the direction of the outgoing scattered wave. Again, the terms of order 
(r'/r) 2 can be safely neglected relative to the two terms that we have retained. With 
these approximations, (13.38) in the asymptotic limit becomes 

Vr(r)-» Ae ikz - f d 3 / e~ ik f r 'v (r')^(r') (13.43) 

r-*oo Tjlfrr J 

In retrospect, we can now see why we made the choices we did in deriving (13.43). 
Clearly, we chose the particular solution Ae' kz to (13.27) to match up with the bound¬ 
ary condition that our wave function include the correct incident wave. Similarly, in 
deriving the Green’s function (13.37), we discarded the incoming spherical wave 
Ce~ ,kr /r because of the physical requirement that the potential generate only out¬ 
going waves. In practice it may be possible to do scattering with incoming spherical 
waves, 2 but most experiments are similar to the approach described earlier in which 
an incoming plane wave generates an outgoing spherical wave upon interaction with 
the target. 

We are now ready for the Bom approximation. As we remarked earlier. (13.43) is 
an integral equation that involves the wave function i j/ on the right-hand side within 
the integral, as well as on the left-hand side. If the potential energy V were set to 
zero, the solution for \J/ would be simply \j/ — Ae ,kz . This suggests that if the mag¬ 
nitude of the potential energy is small compared with the energy E. we can replace 
the wave function xjr (r') within the integral with that of the incident wave. Then 

1r(r) -» Ae ikz -— f d \< e - ik f r 'V(r')Ae ikz ' (13.44) 

r->oo 2nh 2 r J 


2 An example might be using laser light to implode a pellet of deuterium in an attempt to 
generate thermonuclear fusion reactions. 
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Comparing this equation with general asymptotic expression (13.7) reveals that the 
'cattering amplitude is given in the Bom approximation by 

/(0,0) = -- ffr- \ rf 3 rV-' k /- r V(iV Az ' 

=- f d 3 r'V( r , )e' q,r ' (13.45) 

2nh 2 J 

where we have introduced the vector k, with magnitude k directed along the z axis, 
the direction of the incident wave. Note that the vector hq — ftk,- — hk t = p, — p , 
is just the momentum transferred from the incident beam to the target during the 
scattering process and that the scattering amplitude is, up to an overall constant, 
just the Fourier transform of the potential energy with respect to q. This Born 
approximation can be considered as the first in a series of approximations arising 
from an iterative procedure in which the wave function determined by the previous 
iteration, such as (13.44), is then substituted for the exact wave function on the 
right-hand side of (13.43). 

A rough estimate of the range of validity of the Born approximation can be made 
by noting that since we replace i/rfr') by i/r inc in (13.43), we want l^sc/^ind 1 
within the range of the potential (where V / 0), that is, in the vicinity of the origin. 
Comparing (13.7) with (13.38), we see that 


u r , «'*lr-r'l 

tM r ) = -~ dV e - --F(rW) 03.46) 

2nhr J r - r'| 


Thus since i/q nc (0) — A, 


t/r sc (0) 


M [ 

’Ainc(O) 


2n h 1 J 


Jkr ' 


(13.47) 


where we have replaced the exact wave function on the right-hand side of (13.46) 
with the incident wave function in accord w'ith the Bom approximation. If the 
potential energy is spherically symmetric, V(r') — V(r'), we can carry out the 
angular integrals, and the condition for the validity of the Bom approximation 
becomes 


^sc(O) 



$ inc(°) 


2jih 2 Jo 


r dr 1 f 2n d<f>' [ X d& sin Q' r'e ikr, V(r')e ikrlcme ' 

Jo Jo Jo 


[ dr e lkr V (r') sin kr' 

fi 2 k Jo 


«1 


(13.48) 
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At high energies (k oo), the exponential and the sine in (13.48) oscillate rapidly 
and cut off the integral for r' ^ 1 jk. The condition (13.48) becomes in this case 

- 4 - / dr'V{r')kr’ 

H 2 k Jo 

or 

^ « 1 (13.49b) 

E 

as we argued earlier. The Born approximation may also be valid at low energies, but 
under more restrictive conditions (see Problem 13.3). 

13.3 An Example of the Born Approximation: 

The Yukawa Potential 


d^o f l/k 
fi 2 k Jo 


dr' kr 


H 2 k 2 


«1 (13.49a) 


Let’s evaluate the scattering amplitude for the potential energy 

e ~"‘0 r 

V(r) = g - (13.50) 

r 

known as the Yukawa potential. With the appropriate choice for the values of g 
and m 0 , this potential could represent the short-range potential energy between two 
nucleons, say a neutron and a proton. Or if we choose g = Z 1 Z 2 e 2 and m 0 = 0, 
the potential reduces to the Coulomb potential energy between a projectile with 
charge Z,e and a nucleus with charge Z 2 e, as in Rutherford scattering. However, the 
Coulomb potential does not vanish fast enough for large r to ensure that (13.7) is an 
asymptotic solution to the Schrodinger equation in this case, and thus our formalism 
is not appropriate for a pure Coulomb interaction. Nonetheless, we can consider 
the factor e~ m ° r as a mathematically convenient w'ay to introduce the screening that 
actually occurs in Rutherford scattering, where the electrons within the atom shield 
the incident a particle from the nucleus until the a particle penetrates the electron 
cloud. Recall that the size of the atom is on the order of an angstrom, while the size 
of the nucleus is between 10 4 and 10 5 times smaller. 

From (13.45) 

/ (9, 4>) = — [ d\' e -—e‘^ (13.51) 

Inn 1 J r 

In order to carry out the integrals, it is convenient to choose our dummy integration 
variables so that the z axis is parallel to q. Then q ■ r' = qz' -- qr' cos 9', w-here O' 
is the usual polar angle in spherical coordinates. Thus 

/ = -- dr' I 2 * d()>' ^ d6 r sin 9' r'e~ m ° r 'e iqr ' caid ' (13.52) 
2jrfi 2 Jo Jo Jo 
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Figure 13.7 The incident wave vector k,, the scattered wave 
vector k t . and the vector q = k, — k f . 


Carrying out the angular integrals first, wc obtain 


• /»00 

/ = Mi / dr' e- m ° r '(e iqr ' -e~ iqr ') 
fi 2 q Jo 

_ ~2m/? * 

ft 2 (m 2 0 + q 2 ) 

Note that 


<? 2 = (k, - ky) 2 

= (k 2 - 2k, • k f + k 2 ) 

= 2 k 2 ( 1 - cos 6) = 4 k 2 sin 2 - 

2 

and therefore f = f{9) only. The angle 6 is shown in Fig. 13.7. Thus 

£*1 = | fW) \2 = 4 ^ 2 g 2 

dQ /Hfwijj + 4k 2 sin 2 (0/2)P 


(13.53) 


(13.54) 


(13.55) 


The differential cross section depends only on the angle 9 and not on the angle 
<p because of azimuthal symmetry under rotations about the z axis for a spherical 
potential. 

Wc now specialize to the case of Coulomb scattering. At energies and/or an¬ 
gles such that 4 k 2 sin 2 ((9/2) w 2 , the expression (13.55) for the differential cross 

section reduces to 


da_ _ jJL 2 (Z { Z 2 e 2 ) 2 
da ~ hHk 4 sin 4 (0/2) 

(Z^e 2 ) 2 

16£ 2 sin 4 (0/2) 


(13.56) 


the famous result for Rutherford scattering. Interestingly, the dependence on Planck's 
_ ’iistant has disappeared entirely. This differential cross section (13.56) is in com- 
r etc agreement with that obtained from a classical analysis of Coulomb scattering, 
a* a ell as with that obtained from an exact solution using quantum mechanics. 
W uhi ut this rather fortunate agreement between the classical and quantum results. 
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the historical development of quantum mechanics would almost certainly have been 
quite different. It was the agreement between (13.56) and experiment that led Ruther¬ 
ford, before the advent of quantum mechanics, to the nuclear model of the atom. In 
classical physics, this model could not be stable as the electrons in their classical 
orbits radiated away their energy and spiraled into the nucleus. Bohr first addressed 
these problems with his own model, with its stationary states and discrete energies, 
in the crucial transition between classical and quantum mechanics. 

13.4 The Partial Wave Expansion 


In general, the Born approximation is a high-energy approximation for calculating 
the differential cross section. There is another approach, known as the partial wave 
expansion, that is most useful at low energies and is therefore somewhat comple¬ 
mentary to the Bom approximation. 

As we have seen in Rutherford scattering, if the potential energy is spherically 
symmetric, the scattering amplitude is a function of 9 only: 

/(0.tf) = /(0) 03.57) 


We begin by writing 

OC 

f(0) = £(2/ + l)a/(£)P,(cos 9) (13.58) 

/=0 

as a superposition of the partial waves, where 

p,(cos0) = V2^T y; ’° (13-59) 

is a Legendre polynomial (see Problem 9.17). In a sense, all we are saying in (13.58) 
is that we can write any function of 9 as a superposition of Legendre polynomials. 
After all, these functions form a complete set. The rationale for expressing the 
coefficients in the expansion in the form (13.58) will become clear’ shortly. For now, 
let us note that the value of the coefficient cij(k) will, in general, depend on the value 
of the energy, a dependence exhibited explicitly by indicating that the partial wave 
is a function of k. 

Although in principle we have traded f{9) for an infinite set of partial waves, 
the utility of (13.58) comes from the fact that at low energies only a few of the 
afk) are significantly different from zero. To see why, we first give a heuristic 
semiclassical argument. In general, a beam of particles incident on a target in a 
scattering experiment with a broad spectrum of impact parameters consists of many 
different orbital angular momenta, as can be seen by evaluating r x p for each impact 
parameter (see Fig. 13.8). However, if the potential energy has a finite range a, 
scattering will occur only for those impact parameters that are less than a. Thus for 


Page 482 (metric system) 




466 13. Scattering 



Figure 13.8 A classical representation of the incident flux in terms of particles following 
ell-defined trajectories. Those particles with impact parameter b possess orbital angular 
momentum |L| = |r x p| = bfik. Only particles with impact parameters less than or equal 
to the range a of the potential energy would interact with the target. 

■» 


interaction there is a maximum angular momentum, whose value is roughly given 
by hlmscx — a P = or / max = ak. The lower the energy and the smaller the value 

of k , the fewer angular momentum states can interact with the target. 

To start our partial wave analysis in quantum mechanics, we express the incident 
plane wave in the form (see Problem 13.8) 

00 

g/fc cos (9 = J2i‘(.2l + l)j,(kr)P,(cos9) (13.60) 

/=o 

which can be considered as a special case of the more general expansion 

VKr) = c,R,(r)Y L0 (e) (13.61) 

i 

Only the Y t 0 ’s enter because the plane wave (13.60) is independent of </>. Since 
the plane wave is the wave function of a free particle, the radial functions must be 
spherical Bessel functions. We must discard the spherical Neumann functions in 
the expansion because they blow up at the origin, as we discussed in Chapter 10. 
Clearly, the plane wave is finite at the origin. Note that the appearance of all angular 
momenta in this expansion is consistent with a picture of a plane wave, which is 
infinite in extent, as having all impact parameters. 

What can we say about the asymptotic form of the full wave function (13.7), 
ncluding the scattered wave, in terms of partial waves? Since we are searching for 
a solution of the Schrodinger equation for r —* oc, a region in which V = 0 but 
ne that excludes the origin, we must include both spherical Bessel and Neumann 
functions in our general expression for the function /?,(r) in (13.61): 

t/rtr)-* ji(k r ) + B i rj^kr^P^casO) (13.62) 

r-*o o 
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The asymptotic expressions for the Bessel functions themselves are given by 


. „ , sin (kr - ln/2) 

j,(kr) ->--- 

r-^oo kr 


r)i(kr )-> 

r-+ oo 


cos (kr — ln/2) 
kr 


(13.63) 


Substituting these forms into (13.60) and (13.62), we obtain 

sin(/:r — ln/2) 


Jkz 


J2i l (2i + \y- 


1=0 


kr 


P/icosd) 


(13.64) 


for the incident wave and 
tfr(r)--* J2 


sin (kr — ln/2) cos (kr — ln/2) 
A i -;-w _ 


kr 


kr 


P,( cos 9) 


=£c 


1=0 


sin[&/- — ln/2 + Sj(k) 1 
kr 


Pj (cos 9) 


(13.65) 


for the complete wave function, where in the last step we have combined the sine 
and cosine into a sine function with its phase shifted by S t (k). Comparing (13.64) 
and (13.65), we see that the effect of the potential is to introduce a phase shift in the 
asymptotic wave function. Figure 13.9 shows qualitatively how this happens. 

We can express (13.65) in the form 

“ J(kr-ln/2+S,) _ „—i(kr—hr/2+5i) 

yr —-* V C, ——-—-— P,( cos 9) (13.66) 

r—oc “ 2 ikr 

which contains both incoming and outgoing spherical waves. What is the source of 
these incoming spherical waves? They must be due to the presence of the incident 


u u 




Figure 13.9 A depiction of how the potential energy affects the phase of a wave, (a) A 
potential well (an attractive potential) produces a positive phase shift (5 0 > 0) for the 
radial function u = rR while in (b) a potential barrier (a repulsive potential) generates a 
negative phase shift (6 0 < 0). The dashed curve shows u when V = 0 in each case. 
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plane wave in the full asymptotic wave function. If we rewrite (13.64) as 


Jkz 


Mkr-ht/2) _ -i(kr-h r/2) 

2_V(2/ + 1)-—-P,(c os 6>) (13.67) 


1=0 


2 ikr 


we see these incoming spherical waves explicitly. Since-' 


\{r — e‘ 


ikz 


Jkr 

—» m— 

r-*- oo r 


(13.68) 


which is an outgoing spherical wave only, the incoming spherical waves must cancel 
if we subtract (13.67) from (13.66). which implies that 


C, = (2/ + l)e ilir/2 e iSl 


(13.69) 


With this result, we see that 


00 , 

fm = £(2/ + 1)—(e^ - l)/ 5 ,(cost?) 


1=0 

00 




= ^(2/ + 1)-— sin 5, P, (cos 6) 


(13.70) 


1=0 


Thus, comparing (13.58) and (13.70), we find 

e iS i 

cii(k) = — sin 5, 
k 


(13.71) 


Determining the scattering amplitude through a decomposition into partial waves is 
equivalent to determining the phase shift for each of these partial waves. 

In order to determine the total cross section 


o- = / dQ — = f d£l 

J dQ J 


\/m 2 


(13.72) 


we take advantage of (13.59) and the orthogonality of the spherical harmonics, 
namely, 

j dQ Y* m (6, 0)K rv (0, 4>) = (13.73) 


to do the integral over the solid angle: 


a = — ^(2/ + 1) sin 2 5, 


00 


k 2 


(13.74) 


1=0 


In this section we have set the amplitude A of the incident plane wave equal to unity for 
mathematical simplicity'. Note that the differential cross section (13.24) is independent of A. 
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Comparing this result with the expression (13.70) for the scattering amplitude, we 
see that 

o = -plm / (0) (13.75) 

k 

where we have taken advantage of the fact that the Legendre polynomials satisfy 
P)( 1) = 1. Equation (13.75) is known as the optical theorem and is a reflection of the 
fact that, as we discussed earlier, the very existence of scattering requires scattering 
in the forward direction in order to interfere with the incident wave and reduce the 
probability current in the forward direction. 

Finally, it is common to express (13.74) as 

00 

cr = ^oj * (13.76) 

1=0 

where 

a, = ^(2/ + 1) sin 2 8, (13.77) 

the /th partial cross section, is the contribution to the total cross section by the /th 
partial wave. Note that the maximum value for the /th partial cross section occurs 
when the phase shift <5/ = n/2. 

13.5 Examples of Phase-Shift Analysis 


HARD-SPHERE SCATTERING 

We first analyze the scattering from the repulsive potential 


V(r) = 


oo r < a 
0 r > a 


(13.78) 


which characterizes a very hard (impenetrable) sphere. Our earlier discussion sug¬ 
gests that at sufficiently low energy the / = 0 partial wave dominate* the expansion 
(13.70). Determining the phase shift is particularly easy for S-wave scattering, since 
when / = 0 the radial equation simplifies considerably with the elimination of the 
centrifugal barrier/(/ 4- l)/i 2 /2/zr 2 . Outside the sphere, the function u = rR satisfies 
the free-particle equation 


_ Eu 

2/z dr 2 


r > a 


(13.79) 


Rather than write the solution to (13.79) in the form u = R cos kr + C sin kr (or 
in the form u — Be ikr + Ce~‘ k '), we are guided by asymptotic form for the radial 
wave function in (13.65) to write 


u = C sin (kr + S 0 ) r > a 


(13.80) 
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u 



Figure 13.10 A plot of the wave function u = rR for S- 
wave scattering front a hard sphere showing the phase shift 
£ 0 = —ka. The dashed curve shows u when V = 9. 


where, as usual, k — yjliiE/h 2 . Figure 13.10 shows a plot of u. The boundary 
condition u(a) = 0 determines the S-wave phase shift: 

C sin(ka + <$ 0 ) = 0 or S 0 = —ka (13.81) 


Thus the S-wave total cross section is given by 


47r . 2 An . 2 
<*l =0 = sm <5 q — sin ka 


(13.82) 


Problem 13.10 shows that the higher partial waves, such as the P-wave, can be 
neglected relative to the S-wave for hard-sphere scattering at low energy. Thus 

An i o 

a -► cr /=0 -> —- (ka) 2 — Ana 2 (13.83) 

0 ku-* 0 k~ 


Notice that the cross section is indeed an area, but in this case the area is four 
times the classical cross section na 2 that the sphere presents in the form of a disk that 
blocks the incident plane wave. Of course, low-energy scattering corresponds to a 
very long wavelength for the incident wave, and thus we should not expect to obtain 
the classical result. However, even at high energies and very short wavelengths we 
cannot completely avoid diffraction effects. Wc give a heuristic argument. At high 
energies, many partial waves, up to Z max = ka, should contribute to the scattering. 
Therefore 


<7 - 

<•<!» 1 



(13.84) 
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With so many / values contributing, we assume we can replace sitr S t by its average 
value, in the sum. 4 Then 


a ->y^(2Z + l)- *■ liter (13.85) 

k«»\ “ k 1 *a» I 

Why do we obtain twice the classical result, even at high energy? For hard-sphere 
scattering, partial waves with impact parameters less than a must be deflected. How¬ 
ever, in order to produce a “shadow” behind the sphere, there must be scattering in 
the forward direction (recall the optical theorem) to produce destructive interference 
with the incident plane wave. In fact, the interference is not completely destructive 
and the shadow has a bright spot (known in optics as Poisson's bright spot 5 ) in the 
forward direction. 


S-WAVE SCATTERING FOR THE FINITE POTENTIAL WELL 

As another example of the determination of the phase shift at low energy, w'e examine 
scattering from an attractive potential, namely, the spherical well 


V(r) = 


— V 0 r < a 
0 r > a 


(13.86) 


In terms of the function u = rR. the energy eigenvalue equation becomes 


h 2 d 2 u 

“ 51 ^“ 0 “ 

fi 2 d 2 u 

-- — Eu r > a 

2/r dr 2 


Equation (13.87a) can be written as 


= *u = y|r (£ + v °} r<a 


(13.87a) 

(13.87b) 

(13.88a) 


4 For a detailed analysis, see J. J. Sakurai and J. Napolitano, Modem Quantum Mechanics , 
2 nd edition, Addison-Wesley, San Francisco, CA, 2011, p. 421. 

5 Ironically, Poisson, who supported a corpuscular theory for light, refused to believe Fresnel’s 
prediction that a bright spot would occur in the shadow of an illuminated disk, until it was 
experimentally verified. 
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' hilc equation (13.87b) is the usual 

^- = -k 2 u k = J 2 ~E r > a (13.88b) 

dr 2 V f) 2 

T he solution to (13.88a) that satisfies the boundary condition «(0) = 0 is 

u = A sin k^r r < a (13.89a) 

As before, we write the solution outside the well in the form 

u — C sin (kr + <$ 0 ) r > a (13.89b) 

allowing explicitly for the appearance of a phase shift. The finite spherical well is 
especially nice because we can determine analytically the wave function everywhere, 
both inside and outside the well. 6 

Making sure that u is continuous and has a continuous first derivative at r = a, 
we obtain 


A sin k^a — C sin (ka + <S 0 ) (13.90a) 

Ak 0 cos k 0 a — Ck cos (ka + <5 0 ) (13.90b) 


Dividing these two equations, we obtain 


tanfktf + So) = — tan fcja — 


ka 

— tan k 0 a 
k 0 a 


(13.91) 


This equation for the S-wave phase shift simplifies considerably at sufficiently low 
energy, that is, ka —>■ 0. Since ka/k 0 a <<C 1, as long as tan k 0 a is not too large, the 
right-hand side of (13.91) is much less than one and we can replace the tangent of a 
small quantity w'ith the quantity itself. Thus 


ka + S 


_ ka , 

0=1— tan k o a 
k 0 a 


(13.92a) 


or 


So = ka 


( tan k 0 a 
koa 



(13.92b) 


6 For other potential energies for which an analytic solution is not so easy to determine, we 
can still solve the energy eigenvalue equation by integrating the SchrSdinger equation numerically 
outwards from the origin. Comparison of the numerical solution with the asymptotic form of the 
radial wave function that appears in (13.65) permits a determination of the phase shift. 
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From (13.77) 


Since 



k Q a = .jk 2 a 2 + 


2 fiVffi 2 

h 2 


then for sufficiently small ka 


k 0 a = 



and the total (S-wave) cross section is independent of energy. 


(13.93) 


(13.94) 


(13.95) 


RESONANCES 

There is a significant exception to this independence of the cross section on energy. 
Suppose that the quantity y/2 /j.V 0 a 2 /H 2 is slightly less than n/2. Then as the energy 
increases, k a a, as given in (13.94), can reach the value of n /2. In this case, tan k$a 
is infinite, and therefore we can no longer assume that the right-hand side of (13.91) 
is small, even for small ka. In fact, at the value of the energy when k 0 a — n/2, 
tan (ka 4- <5 0 ) = oo and, consequently, ka -f So = n/2, which implies S 0 = n/2 since 
we arc presuming ka <SC_ l. 7 Thus 

5 in 2 S 0 £^=W (jlj) (13.96) 

Here w'e see a pronounced dependence of the total cross section on energy. Also 
notice that the magnitude of the total cross section is much larger than that given in 
(13.93). Instead of being a small quantity, the phase shift S () = n/2. 

What is causing this unusual behavior? If you return to our discussion of the 
bound states of the finite spherical well in Chapter 10, you will recognize the 
condition 


2/xVq a 2 ^ £ 
h 2 ~ 2 


(13.97) 


7 Since the phase shift 5 0 starts at zero, we are assuming that the condition <5 0 = n/2 is the first 
resonance condition that we reach as this phase shift grows. 
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as the condition that the well has a bound state at zero energy. Thus for a potential well 
'atisfying (13.97), the energy of the scattering system is essentially the same as the 
energy of the bound state. In this case, the incident particle in a scattering experiment 
w ould like to form a bound state in the well. Since the system has a small but positive 
energy, the bound state isn’t stable. However, pulling in the wave function in an 
attempt to form such a bound system dramatically changes the scattering behavior. 

Although it is more difficult to determine the phase shifts and cross sections 
in such a nice analytic form as (13.92) for higher partial waves, it is easy to see 
physically why these higher partial waves may exhibit resonant behavior, with large 
"bumps” in the partial cross sections. These bumps arise when the phase shift goes 
through odd multiples of jt/ 2. Figure 13.11 shows a plot of the effective potential 
energy 

•» 

Veff(r) = V(r) + A/ + U ! r (13.98) 

2/j.r 1 

for the spherical well. A particle with energy E greater than zero but less than the 
height of the barrier can tunnel through the barrier and form a metastable bound state 
in the well. This state is metastable (and not stable) because a particle “trapped” 
inside the well can also tunnel out. Thus if the energy of the beam in a scattering 
experiment is tuned to one of the energies of these metastable states, there is an 
enhanced tendency for the particle to get stuck in the well. The system then loses 
track of the mechanism by which the bound state was formed, that is, in particular it 
may lose track of the direction of the incident beam. When this metaslable state 
decays, it emits the particle with the characteristic angular distribution for that 
particular decay mode. 

A convenient way to parametrize the behavior of the phase shift in the vicinity 
of a resonance leads to the famous Breit-Wigner formula. We assume the phase 
shift S[ of the /th partial wave goes through n/2 at an energy £ 0 : 

S,(E 0 ) = | (13.99) 


We next make a Taylor series expansion of cot in tire vicinity of the resonant 
energy: 

cot 5 ; (£) = cot 8,(E 0 ) + ( 1 ) (E — E 0 ) + • * • 

V dE / E=Eq 


( 1 ds,\ 

\sin 2 8[ dE J £= £ o 


(E — E 0 ) + • • • 


(13.100) 


Defining 


/ ^/(E) \ = 2 

l dE J E=Eo r 


(13.101) 
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V 



Figure 13.11 The centrifugal barrier combines with the potential well to produce an 
effective potential that can produce a metastable state, as indicated. If the energy of the 
incident beam coincides with the energy of one of these metastable states, a resonance in 
the scattering cross section can occur. 


we obtain 


cot S,(E) = —p(£ — E 0 ) H- 

Finally, we can express the function a,{k) [see (13.71)] as 

e iS i 

a/(k) = — sin S / 
k 

_ 1 _ 1 _ 

k cot Si — i 

s l_1_ 1 r /2 

” k -(2/ D(£ - E 0 ) -i k (E — E 0 ) + iT/2 


(13.102) 


(13.103) 


Repeating the steps leading to (13.74). we find that the total cross section for the /th 
partial wave in the vicinity of a resonance is thus given by 


_ 4tt „ r 2 /4 

n ‘ = k 2 { ~ + \e-E 0 ) 2 + r 2 /4 


(13.104) 


As an example, Fig. 13.12a shows a strong resonance in n + -p scattering with 
a peak at roughly 190 MeV of incident pion kinetic energy. This resonance, known 
as the A(1232) because the center-of-mass energy of the resonance at the peak is 
1232 MeV, has a full width at half maximum T of 110-120 MeV. Figure 13.12b 
shows the P-wave phase shift, which reaches jt/ 2 at the resonance peak. The reso¬ 
nance is thus formed in the / = 1 channel. In fact, the intrinsic spin of the A(1232) 
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(a) (b) 

Figure 13.12 (a) The total cross section for n + -p scattering with pion kinetic energies up 
to 400 McV. Adapted from W. R. Frazer, Elementary Particles , Prentice-Hall, Englewood 
Cliffs. NJ, 1966. (b) The P-wave phase shift for rt + -p scattering. Adapted from L. D. 
Roper, R. M. Wright, and B. T. Feld. Phys. Rev. 138. B190. 1965. 

is j = From (13.104), we expect cr /=1 = \2n/k 2 when £ = £ 0 . However, when 
we add the orbital angular momentum / = 1 of the pion-proton system to an intrinsic 
spin s = 4 of the proton, we can from a total angular momentum j — \ (two states) 
as well as j — 5 (four states). Thus of the six total angular momentum states that 
are generated (presumably incoherently) in the collision, only the four j = | states 
can produce the resonance. If we assume the scattering cross section for the j = in¬ 
states is negligible in comparison with the j — | resonance scattering in the vicinity 
of the peak, the jv^-p total cross section at the resonance should be | of the P-wave 
peak cross section, that is 8 jr/A 2 , which is about 190 mb, in good agreement with 
the observed value. 

Finally, we return to our discussion of S-wave scattering for the finite potential 
well and ask what happens if we increase the energy of the beam so that the phase 
shift <5 0 -» 7 r, as illustrated in Fig. 13.13. Then the wave function outside the well, 

u = C sin(Ar + it) = — C sin kr (13.105) 

is the same as the wave function outside the well with zero phase shift [see (13.89b)] 
up to an overall phase. Moreover, since sin <5 0 = 0, the S-wave partial cross section 
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II 



Figure 13.13 The wave function w(r) for S-wave scattering for a potential well at an 
energy such that the phase shift = rr. The dashed curve shows it when V = 0. 


vanishes (a I=0 = 0). This effect, known as the Ramsauer-Townsend effect , 8 is 
clearly seen in the very low scattering cross section from noble gases at about 0.7 eV. 
These rare-gas atoms have a potential well with a sharply defined range, and it is 
possible at low energies to have S 0 = tt , with all other phase shifts negligible, leading 
to essentially perfect transmission of the incident wave. 


13.6 Summary 


The differential scattering cross section is given by 

^- = \f{d,(P )\ 2 (13.106) 

ail 

where the scattering amplitude f(6, 0) determines the angular dependence in the 
asymptotic form for the solution to the Schrodinger equation: 

0 -► Ae ikz + Af{9,4>)— ( 13 . 107 ) 

/--►oc r 

At “high” energies, the scattering amplitude can be determined through the Born 
approximation 


/(0,0) = --i l — f rfVV(rV q-r ' 

2jr h- J 


(13.108) 


8 Unfortunately, a different Townsend. 
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which is (up to constants) the Fourier transform of the potential energy with respect 
to the vector q = k, — k,. 

At “low” energies, on the other hand, the scattering amplitude for scattering from 
a central potential can be determined from the partial wave expansion 

~ js, 

f(6) = )(2l + 1)— sin 5, F, (cos (9) (13.109) 

to k 

leading to a total cross section 


A 

(T = -4 5 ^( 2 /+ 1 ) Sin 2 5, (13.110) 

k /=o 

The phase shifts 8 h which are generated when the particular partial waves interact 
with the potential, enter in the asymptotic expression for the wave function 


V/(r)- *■ 

r—►oc 


f 1Cl ^r- l „2 + 8 l m pi(cose) 

i=o kl 


(13.111) 


A useful way to determine the phase shifts is to solve the Schrodinger equation, 
either numerically or analytically, for the radial wave function u — rR, which obeys 
the one-dimensional Schrodinger equation 


Fr d-it 
dr- 


1(1 + 1 )/r 
2 /xr 2 


li + V(r)n = Eu 


(13.112) 


and compare the asymptotic form of the solution with (13.111). 


Problems 


13.1. Use the three-dimensional time-dependent Schrodinger equation 


-> 3 \[f 

2/t at 


to establish that the probability density \(/*(r, t)xlr( r, t) obeys the local conservation 
law 

f(*V) + v-j = o 

3/ 

where 

j=~ (rv'i'-i'vr) 

2m 


What w'ould happen to your derivation if the potential energy V were imaginary? 
Is probability conserved? Explain. In nonrelativistic quantum mechanics, such an 
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imaginary potential energy can be used, for example, to account for particle absorp¬ 
tion in interactions with the nucleus. 


13.2. Evaluate the probability current for the scattered wave 

V'sc-*• A/(0, d>) — 

r~* cc r 

and show lhat 

• //& , . ,9, 

Jsc-► — r|A|-|/|-U r 

r-»oo 

where u,. is a unit vector in the direction of the radius. 

13.3. Show that at low energies (ka -*■ 0) the requirement (13.48) for the validity 
of the Bom approximation becomes 


If-'-' 


n 2 


« i 


where V 0 is the order of magnitude of the potential energy, a is the range of the 
potential, and we have neglected constants of order unity . By comparing this result 
with (13.49). argue that if the Bom approximation is valid at low energies, it works 
at high energies as well. 


13.4. Using the Bom approximation to determine the one-dimensional reflection 
coefficient R for a potential energy V (x) that vanishes everywhere except in the 
vicinity of the origin: 

(a) Show that we can write the solution to the one-dimensional Schrodinger 
equation in the form 


V>(x) = Ae ikx + J (lx G(x, x')~V(x')\l/(x’) 


where 


o2 

—G(.v. x') + k~G(x, x') — S(x - x') 
dx- 


(b) Since G satisfies a second-order differential equation. G must be a continuous 
function and, in particular, it must be continuous at.v = x'. By integrating the 
differential equation for G from just below to just above .r = .v', show that the 
first derivative of G is discontinuous at x — x' and that it satisfies 
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Then show that one solution for G is given by 


G — 


1 c ik(x-x') 

2ik 

_L e -ik(x-x') 
2 ik 


x > x' 
X < x' 


(c) Substitute this expression for G into the equation for x/s in (a). Show that in 
the Bom approximation 


xfr 


Ae ikx + Ae 


noo 2ikx’ 

- tkx / dx' — 

J -oo 2 i 


2m 


2 ik n 2 


V(x') 


and that consequently 


R 


(d) For the potential barrier 


/ OO 

dx' e 2ikx 'V(x') 

■00 


V(x) = 


Vq 0 < x < a 


0 elsewhere 
the exact reflection coefficient is given by R — 1 — T with 


T = 


1 + \V 2 /4E(E - V 0 )] sin 2 V(2 m/h 2 )(E - K 0 ) a 

Show that the exact result for R in the limit V a /E <<C 1 agrees with the result 
of the Bom approximation. 


13.5. In the initial Rutherford scattering experiment Geiger and Marsden used a 
panicles with an energy of 5 MeV. Choose a reasonable value for m 0 and determine 
the range of angles for which (13.56) should be valid. Suggestion: Write m 0 = 1/a, 
where a is a characteristic screening length. How large should a be? Note: Interaction 
with the electrons in the atom produces a deflection on the order of 10 -4 radians. 


13.6. Use the Born approximation to determine the differential cross section for the 
potential energy 



a here C is a constant, corresponding to a 1 /r 3 force. Note: The result depends on 
. so it is not the same as the classical result. 


13.7. Use the Bom approximation to determine the differential cross section for the 
potential energy 
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V(r) = V 0 e~ r ' a 

13.8. Our goal is to establish (13.60), which, with the aid of (13.59), can be put in 
the form 

OC 

£ ikr cos 0 = J- i^An{ll + 1) J,(kr)Y in (9) 

1=0 

(a) Explain why the expansion of the plane wave must be of the form 

00 

e ikrcos o = J2 c , Mkr) y lo (9) 

1=0 

(b) Use the fact that ■ 

1 / L_V 

|/, 0) - —= — |/, /) 

V(27)! \ ft / 

(see Problem 9.18) to show 

cMkr) =7mJ d ° r ‘‘ | [r" ( f s - cot »£)]' 

= —L= [ <m Y* U-W sin' 9 - C l— e 'krcosO 

V(2 Ty. J u L d (cos 9) 1 

where the last step follows from the explicit form (9.142) for the raising 
operator in position space. 

(c) Use the explicit form (9.146) for Y u (9. efr) to express this result in the form 

c,JMr) - (ftri 

(d) Finally, isolate c ; by evaluating this expression as r —» 0. Hint: For small r, 
ji(kr) behaves as (kr) 1 /(2l + 1)!!. where (21 4- 1)!! = (21 4- 1)(2/ — 1) • ■ • 5 ■ 
3- 1. 

13.9. A particle is scattered by a spherically symmetric potential at sufficiently low 
energy that the phase shifts 8/ = 0 for / > 1 (that is, only S 0 and <5, are nonzero). Show 
that the differential cross section has the form 

— = A + B cos 9 + C cos 2 9 

dQ. 

and determine A, 6, and C in terms of the phase shifts. Determine the total cross 
section a in terms of 4, R, and C. 
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13.10. Evaluate the F-wave phase shift S, for scattering from a hard sphere, for which 
the potential energy is given by 

' oo r < a 

V(r) = 

U r > a 

Express your result in terms of j\(ka) and ii\(ka). Use the leading behavior of j,(p) 
and ??|(p) for small p to show that <5, -»• —(ka ) 3 /3 as ka -*■ 0, and thus (5, can indeed 
be neglected in comparison to <5 0 [sec (13.81)] at sufficiently low energy. 


13.11. Compare the Bom approximation result for the total cross section for scatter¬ 
ing from the potential well 


V (r) = 


-V 0 r <a 

0 r > a “* 


with that obtained by using S-wave phase shift analysis. Using the condition for the 
validity of the Bom approximation at low energy (see Problem 13.3), show that the 
two approaches are in agreement when the Born approximation is valid. 


13.12. Consider the spherically symmetric potential energy 


2 nV{r) 

h 2 


— y8(r - a) 


where y is a constant and <5(r — a) is a Dirac delta function that vanishes everywhere 
except on the spherical surface specified by r — a. 


(a) Show that the S-wave phase shift <S 0 for scattering from this potential satisfies 
the equation 


Um(ka 4- S 0 ) — 


tan ka 

1 + (y/k) tan ka 


(b) Evaluate the phase shift in the low-energy limit and show that the total cross 
section for S-wave scattering is 


a 


Area 1 


ya 


1 + ya 


13.13. 

(a) Determine the differential cross section do/dSl in the Bom approximation 
for scattering from the potential energy 2 nV(r)/h 2 = yS(r — a) (see Prob¬ 
lem 13.12). Show the explicit dependence of do/dSl on 0. 

(b) Evaluate da/dQ in the low-energy limit. Show that the differential cross 
section is isotropic. What is the total cross section? 

(c) Use the condition for the validity of the Bom approximation at low energy 
(see Problem 13.3) to establish that your result in (b) for the total cross section 
agrees with that given in Problem 13.12 in the appropriate limit. 
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CHAPTER 14 

Photons and Atoms 


In this chapter we turn our attention to a quantum treatment of the electromagnetic 
field. After analyzing the Aharonov-Bohm effect which demonstrates the unusual 
role played by the vector potential in quantum mechanics, we use the vector potential 
to show that the Hamiltonian for the electromagnetic field can be expressed as 
a collection of harmonic oscillators. The raising and lowering operators for these 
oscillators turn out to be creation and annihilation operators for photons, the quanta 
of the electromagnetic field. This quantum theory of the electromagnetic field is 
then used to determine the lifetimes of excited states of the hydrogen atom using 
time-dependent perturbation theory. 


14.1 The Aharonov-Bohm Effect 


Within classical physics, the vector potential A is simply an auxiliary field that 
is introduced to help determine the physical electromagnetic fields E and B. In 
particular, Gauss’s law for magnetism. 

V • B = 0 (14.1) 


implies that we can write 


B = V x A (14.2) 

since the divergence of a curl vanishes. Moreover, when expressed in terms of the 
vector potential, Faraday’s law. 


„ ,, 1 3B „ 

V x EH-=0 

c 3 1 


(14.3) 


483 
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becomes 


which implies 



„ 19A 

E H-= — V<p 

c Bt 


(14.4) 


(14.5) 


since the curl of a gradient vanishes. 

We can always alter the function A by adding to it the gradient of a scalar 
function y: 


A—*-A + Vy * (14.6a) 


This transformation does not affect the magnetic held (14.2), and the electric field 
(14.5) will also be unaffected provided 

cp —*■ (p — (14.6b) 

C Bt 

as well. The transformation specified by (14.6) is known as a gauge transformation. 
Although the potentials (p and A are altered by a gauge transformation, the "physical'’ 
electric and magnetic fields are not. 

We can see the special role the vector potential plays in nonrelativistic quantum 
mechanics by considering the Aharonov-Bohm effect. As background, first consider 
a long solenoid carrying a current. The magnetic field inside the solenoid is uniform 
and has the magnitude B 0 . From the definition (14.2) of the vector potential, we find 

j (V x A) -dS = J BdS (14.7) 

for the flux of the magnetic field through any surface S. We can take advantage of 
Stokes's theorem to convert the surface integral on the left-hand side of (14.7) to a 
closed line integral: 


f 


A 



B-r/S 


(14.8) 


For the solenoid we take as our path a circle of radius p centered on the axis of the 
solenoid, as shown in Fig. 14.1. From the azimuthal symmetry of the solenoid, the 
magnitude of the azimuthal component of A must be the same everywhere along 
the path. Thus we find for a circular path of radius p that is less than the radius R of 
the solenoid 

(£ A • dr = A2np — B a np 2 p<R (14.9) 


Page 501 (metric system) 



14.1 The Aharonov-Bohm Effect | 485 


Bo 



Figure 14.1 A line integral for evaluating the vector 
potential for a solenoid. 


or 


-( 


R 0P 

2 


P<R 


Outside the solenoid, the integral for the magnetic flux is given by 




B • dS = B 0 nR 2 p>R 


(14.10) 


(14.1 I) 


since the magnetic field vanishes outside a long solenoid. Thus from (14.8) we find 


A =(^-) u 0 P>R (14.12) 

We can check our results (14.10) and (14.12) by using the gradient in cylindrical 
coordinates, 


„ 3 18 3 

V = U„-h U.*-b u z - 

p 3 p %3<£ 3z 


(14.13) 


to evaluate the curl of the vector potential and to verify that it yields a uniform 
magnetic field B 0 within the solenoid and zero field outside the solenoid. Thus 
outside the solenoid the magnetic field vanishes while the vector potential does not. 

Let’s now reconsider the double-slit experiment for particles wfith charge q with 
an additional feature. Suppose that directly behind the barrier between the two slits 
we insert a small, very long solenoid, as indicated in Fig. 14.2. Recall that the 
intensity at an arbitrary point P on the screen arises from the interference between the 
amplitude i//, for the particle starting at the source point S to arrive at P after passing 
through one of the slits and the amplitude i jr 2 for it to arrive at P after passing through 
the other slit. Of course, as we saw in Chapter 8, there are many neighboring paths 
for both paths 1 and 2 that have essentially the same phase and therefore contribute 
coherently w'hen evaluating the path integral (8.28). 
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Figure 14.2 The double-slit experiment 
with a long solenoid inserted behind the 
barrier. The closed contour formed by 
following path 1 from the source S to 
the point P on the distant screen and then 
back to the source along path 2 includes 
the magnetic flux of the solenoid. 


Surprisingly, the phase for each path that contributes to the path integral is 
modified by the presence of the solenoid, even though the magnetic field may vanish 
at all points along the path. According to (E.6), the Lagrangian for a particle of 
charge q picks up an additional term q A • v/c, and thus the amplitude to take path 1 
is modified by 


iff i -> xj /1 exp 


-) f 

lie/ Jtr, 


A • v dt 


(14.14) 


where f ( ) is the initial time at which the charged particle leaves the source and t' is the 
final time when it reaches the point P. Since v dt — dr, we can express (14.14) as 


xjfy —*■ xj/ i exp 


/(f) f 

\ncj J path 1 


A • dr 


(14.15) 


while the corresponding expression for the amplitude to take path 2 is modified by 


-*■ ’A 2 e xp 


)/ ■ 
/ J path 2 


A • dr 


(14.16) 


Thus the amplitude to reach the point P by passing through either of the slits is 
given by 


Vt + V'2 -*■ i/t ex P 


iff) f 

V lie / J pmh 1 


A • dr 


exp 

i(±)[ A-dr 
\fic / J path 2 

i//^ exp 

exp 

A ■ dr 

\ He J J path 2 

j Va exp i 


+ 1 2 exp 

ih)L 

'(£)/ 


A ■ dr 


(±)f 

\hc J J path 2 

+ t2 


J path 1 

A ■ dr 


A • dr 


path 2 


A • dr 


+ ^2! 
(14.17) 


In the last step we find that the relative phase between t/q and xjr 2 is proportional to 
the dosed line integral of the vector potential going from the source to the point P 
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along path 1 and back to the source from point P along path 2. Talcing advantage of 
(14.8), we see that 

i//, + ex P 

(14.18) 


iff) f 

\ He J J path / 


A • dr 


\j/\ exp 


fie 


B • dS 


+ t//? 


where the relative phase has now been expressed in terms of the flux of the magnetic 
field through the closed path. The presence of this relative phase will cause a shift 
in the interference pattern as the magnetic field in the solenoid varies. For example, 
when 


/ B-dS^2nn 

he J 


n — 0, 1,2,..*'! 


(14.19a) 


the pattern will be the same as without the magnetic field present, while when 


— / B ■ r/S = (2n + l)rr n = 0,1,2,... (14.19b) 

he J 

the position of the minima and the maxima in the pattern will be interchanged. 

This is a rather startling result. Classically, we would expect that the particle 
must follow either path 1 or path 2. Along each of these paths the magnetic field B 
vanishes everywhere. How then does the charged particle know about the magnetic 
field within the solenoid? While the classical particle responds to the magnetic field 
only where the particle is-—that is, locally—the quantum particle has a probability 
amplitude to take both paths. Since the solenoid produces a vector potential that 
changes the phase for each of the paths, in some sense we might say that the particle 
compares the phase that it has picked up along the two different paths and responds 
directly to the phase difference. Notice that this relative phase difference depends on 
the magnetic flux passing through the surface bounded by the paths and not on the 
vector potential itself. Thus the phase difference is a gauge invariant quantity that 
may in fact be measured. 1 Even though the phase difference depends on the magnetic 
field B and not on vector potential A directly, the Aharonov-Bohm effect suggests 
that the particle learns about the magnetic field by responding to the vector potential 
along the path. 


1 A number of experiments confirming the prediction (14.19 ) have been carried out. The first 
was done by R. G. Chambers, Phys. Rev. Lett. 5,3 (1960). A more recent experiment is that of 
A. Tonomura et al., Phys. Res'. Lett. 48. 1443 (1982). Many people found Y. Aharonov and 
D. Bohm's 1959 paper [Phys. Rev. 115. 485 (1959)] difficult to believe, which surprised Bohm, 
for he knew it would be much more surprising if the experiments did not confirm their prediction, 
since that would mean that quantum mechanics itself was wrong. 
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14.2 The Hamiltonian for the Electromagnetic Field 


Given the results of the preceding section, we should not be surprised that our 
discussion of the quantum mechanies of the electromagnetic field starts with the 
vector potential. We have already taken advantage of two of Maxwell’s equations, 
(14.1) and (14.3), to introduce the scalar - potential q> and the vector potential A. 
Expressed in terms of these potentials, the remaining two equations, Gauss’s law, 

V-E = 4 jrp (14.20) 


and Ampere’s law. 


are given by 


and 


v, u 4jr • 1 3E 

V x B = —j H — 

c c at 


—V 2 <p — - — V ■ A = 4np 
c dt 


4n. 1 9 / „ 1 9A\ 

V(V-A)-V-A = —J+-— -V<p-- — 
c c dt V c dt J 


(14.21) 


(14.22) 


(14.23) 


respectively. 

A physically transparent gauge for analyzing the electromagnetic field is the 
Coulomb gauge, which takes advantage of our freedom to make gauge transfor¬ 
mations to impose the constraint 


V ■ A = 0 (14.24) 

on the vector potential. The reason for calling this gauge the Coulomb gauge becomes 
apparent when we make the replacement (14.24) in (14.22). Then the scalar potential 
satisfies the equation 

V 2 <p = -4 np (14.25) 

for which the solution can be expressed as 

<p(r, t) = f < 14-26 ) 

J |r — r'| 

This is just the usual expression for the scalar potential arising from a charge 
distribution p in electrostatics. Notice, however, that we have not restricted ourselves 
to static charge distributions and, in fact, the value of the scalar potential at the 
position r at time t is determined by the charge distribution p{ r', t) at the same 
time t. Thus there are no retardation effects arising from the finite time |r — r j/c it 
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takes for a change in the charge distribution at r' to produce a change in the field at 
r. The absence of these effects in the scalar potential in this gauge emphasizes that 
physical effects depend directly on the electric and magnetic fields and not on the 
potentials, which can always be altered by a gauge transformation. 

In Appendix E we find that the nonrelativistic Hamiltonian for a particle of mass 
m and charge q interacting with the electromagnetic field is given by 


H - 


(p - gX/cY 
2m 


+ q<p 


(14.27) 


If we set A — 0, the remaining terms are the kinetic energy and electrostatic potential 
energy that we have included in our initial treatment of systems such as the hydrogen 
atom. We will examine the effect of the interaction of the charged particle with the 
vector potential in the next few sections. First, however, we need to examine the 
Hamiltonian for the free electromagnetic field, that is, the field energy in the absence 
of charges and currents (p = 0, j = 0). In this case, the electromagnetic field energy 
is given by 


#e&m = ^ / d\ (E 2 + B 2 ) (14.28) 

See Problem 14.1. According to (14.26), with no charges, cp — 0, and from (14.5), 
E = —(l/c)(9A/3t). Therefore, 




E&M 


= ±f 

8;r J 


d 3 r 


1 9A\ 2 _ 2 

-— J + (V x A) 2 

c 9/ / 


(14.29) 


Note that we have not put a hat on the Hamiltonian because we are treating the 
electromagnetic field as a classical field. In fact, our goal of this section is to begin 
to see how we can make the transition from a classical theory to a fully quantum 
treatment of the electromagnetic field. 2 

Without charges and currents, the equation of motion (14.23) for the vector 
potential in the Coulomb gauge is given by 

-^^-V 2 A = 0 (14.30) 

c 2 dt 2 

A specific solution to this wave equation is given by the plane wave 

A = A 0 e i(kr - a " ) (14.31) 


2 Our approach follows that of J. J. Sakurai, Advanced Quantum Mechanics, Addison-Wesley, 
Reading, MA, 1967. 
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with u> = kc. In addition, the Coulomb gauge condition (14.24) imposes the con¬ 
straint 


V • A = /k • A 0 e' (k ' r-O,f) = 0 (14.32) 

Thus k • A 0 = 0. indicating that the wave is transverse to the direction of propagation, 
which is in the direction of k. For this reason, the Coulomb gauge is often called the 
transverse gauge as well. 

What is the most general solution to the wave equation (14.30)? It can be obtained 
by superposing all the different plane wave solutions. In general, this superposition 
takes the form of an integral over all possible values of k. However, in our discus¬ 
sion of the quantum properties of the electromagnetic field it is somewhat easier 
conceptually to impose boundary conditions on the solutions to the wave equation 
that dictate that the allowed values of k take on discrete rather than continuous values 
and the superposition is in the form of a sum rather than an integral. One convenient 
way to do this is to work in a cubic box of length L on a side subject to periodic 
boundary conditions. For example, we require 

e ik x x _ e ik x (*+D (14.33) 

The condition (14.33) and the corresponding conditions imposed on k y and k z are 
satisfied provided 


k x L = 2tt n x k v L — 2n n y 

k z L = 2nn z n x ,n y ,n z = 0, ±1, ±2, ... (14.34) 

Notice, for example, that as n x takes on all positive and negative integral values, 
k x runs from — oo to oo. Moreover, the separation A k x between adjacent modes is 
given by A k x = 2n/L. Thus as L —* oc and the volume V of the box in which we 
are working approaches infini ty, the allowed values of k x approach the continuum 
that we would have expected had we chosen to work in the infinite volume limit 
initially. 3 It should be emphasized that this discrete set of solutions is a result of our 
having imposed certain boundary conditions on the solutions to the wave equation. 
We are still doing strictly classical physics. 


- Working inside a box may seem quite unphysieal. especially a cubic box. which we have 
chosen for mathematical convenience. However, we will see that any physically measurable 
q uantity is independent of the volume of the box, and thus we can let L —*■ oo without changing any 
• ur results. Given that the volume of the universe itself may be bounded, the idea of introducing 
>uch a box into our calculations may not seem so strange after all. Moreover, if your universe 
> similar to that of a creature living on the two-dimensional surface of a balloon, the periodic 
t> undary conditions that we are imposing may seem almost natural as well. We can take comfort 
in the fact that the effects we are calculating do not depend on either the size or the shape of the 
box. 
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With k taking on discrete values, the most general solution to the wave equation 
is given by 


A(r. r) = £ ( c k v e(k, s) 


k.r 


“7^ 


+ c*^(k,s) 


—(<k-r—rur)' 




(14.35) 


The vectors e(k, s) are unit vectors indicating the direction, or polarization, of the 
vector potential for each value of k. Note that the requirement that A satisfy the 
gauge condition (14.24) reduces to the condition that 


k • <?(k, s) = 0 


(14.36) 


Thus the polarization vector elk, s) is perpendicular to the direction of k. For any 
particular k, there are two linearly independent vectors that satisfy this condition. 
We indicate these two vectors by having s take on the values 1 and 2 in the sum 
over s. For example, if k points in the z direction, we can choose elk. 1) to be a unit 
vector in the x direction and elk. 2) to be a unit vector in the y direction. Since k 
may point in an arbitrary direction, the unit vectors elk. 1) and elk. 2) will not. in 
general, lie along the jc and y axes, respectively. It is. however, convenient to choose 
the set of vectors (e(k, 1), elk. 2). k/|k|) as a right-handed set of orthogonal unit 
vectors. The factors c k f in (14.35) can be considered as coefficients in the expansion 
allowing lor arbitrary amplitudes for each of the plane waves. We have added a term 
involving c k s to ensure that the classical vector potential is a real field. 

We are now ready to evaluate the energy (14.29) of the electromagnetic field. A 
typical term comes from the electric licld energy, w'hich is given by 


-i 

8 tt J c dt c dr 


e i,k r —. no . 


a -i(k-r-cui i• 


=s / d>r £ +s) ~~r 

i(k'-r— io'i) ' 


E 

k'.j' 


-ico’ , , e ,{k ' r 0J ' n ico' , , , s e~ 

-c k / t 'g(k , s )- — -1-c... .elk . s ) - — 

C ’ y/V C y/V 


(14.37) 


Note that we have to sum over all possible values of k and s tw ice. each independently 
of the other, since the vector potential appears twice. In evaluating (14.37), it is 
convenient to first carry out the integral over all space. Here we can take advantage 
of the orthonormality relation 


/ 


d\ 


y/V y/V 5kW 


(14.38) 
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i see Problem 14.2). You can now see why we inserted 1 /y/V factors explicitly in 
the expansion. Thus a sample term from (14.37) is given by 


-ilk'-r-a/l )' 


1 f j3 v—\ (—id) n gHk-r-*)\ ( ic0 ' 

to J “ ' 5 (T*-**• S) ^V ~>' g (t^ "’> VF 

= jjf D XI (-^<k..(')«fli.»)) • (^‘k'.,-(')«(k', s')) Vk' 


^ E £ ^• ( k.v c k,, e ( k ’ *) ■ £ ( k - •*') 


k.s 


I IT - 1 \ ^ 0)1 * S J 6j2 * 

~ 8* ^^ f 2 C kAA4- 8jr L C 2 c k., c k., 


(14.39) 


k..v 


k,.v 


There are actually four such terms present in (14.37), two of which are time depen¬ 
dent. However, when we add the terms arising from evaluating the magnetic field 
energy to those arising from calculating the electric field energy, we find that the 
total energy is simply 


H = 


t 2 

L * 

9 , 2 —> ,2 c k...P k,5 


(14.40) 


The time-dependent pieces from the electric and magnetic field energies have can¬ 
celed, just as we would expect, since the total Hamiltonian for a closed system should 
be time independent. 

Unless your name is Dirac, this Hamiltonian may not look familiar. However, 
following Dirac, we can make the underlying physics more apparent by the following 
nifty change of variables: 


<?k,, = - 4 = (c M + c‘ ) Pk , s = (c k - c* ) (14.41) 

cV4rt c\f An 

in which case we find 



I hus we sec that formally the electromagnetic field can be considered as a collection 
: independent harmonic oscillators. This fact is often used as the starting point for 
a derivation of Planck’s blackbody spectrum. In that approach, the electromagnetic 
energy density is determined as the number of modes (oscillators) in a particular fre¬ 
quency range multiplied by the average energy of each oscillator. The key ingredient 
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in resolving the classical Rayleigh ultraviolet catastrophe, which arises from giving 
each oscillator an average energy of k$T, is to restrict the allowed energies of each 
oscillator to the discrete values that we determined in Chapter 7. 

14.3 Quantizing the Radiation Field 


We are now ready to “turn on" quantum mechanics in our treatment of the elec¬ 
tromagnetic field. We assume that the variables q k s and /; k , should be operators 
obeying the commutation relations 


Pk'jl = i 

just as for the three-dimensional harmonic oscillator for which 

Cl At 1 1 2-2 , A V , I 2*2 . Pz . 1 2*2 

H = — — + -ma> x -- -t- -mco y H- — + -mco z 

2m 2 2m 2 2m 2 


(14.43) 


(14.44) 


with [jf, p x \ ~ [y, p v ] = [z. /).] = ih. and [.v, p y ] — 0. and so on. The commutation 
relations (14.43) seem a natural step in our treatment of the Hamiltonian (14.42), 
although, unlike (14.44), (14.42) is an infinite collection of independent oscillators. 4 
Moreover, from the relations (14.41) relating the variables t/ k( and p k s to the 
coefficients c k( and c£ s in the expansion of the vector potential, we see that if c) kv 
and /3 k v are operators, so are the c k t ’s. 

The natural operators for analyzing the harmonic oscillator are the raising and 
lowering operators, which for the Hamiltonian (14.42) are given by 

^= M (*■• + j*-) = M («“ ■ z hi ) <i445) 

In terms of these operators. 



A comparison of these equations with (14.41) suggests the replacements 


l2Tth „ * FbtK 

C k* * C V ° k s C k,.s C \l , . fl k.j 

V CO V co 


(14.47) 


4 This might be a good point to review Section 7.2 and Section 7.3 on solving the one¬ 
dimensional harmonic oscillator with raising and lowering operators. A more rigorous way to 
introduce these commutation relations in lield theory is to start with a field Lagrangian that yields 
the equations of motion and then to postulate commutation relations for the generalized lield 
coordinates and the corresponding momenta. See, for example, R. Shankar, Principles of Quantum 
Mechanics, 2 nd edition. Plenum, New York, 1980, p. 506. 
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and therefore the classical vector potential A has been replaced by the Hermitian 
operator A, 


A = C \~ (^a> e < k - i-) 

k.j 


J(k-r-a>r) — /(k-r—<wr)' 

+ ^ £ (k,.v) 


Jv 




(14.48) 


Since the vector potential is now an operator, both the electric and magnetic 
fields are operators as well. If we use this expression (14.48) for the vector potential 
to evaluate the Hamiltonian (14.29), which is now also an operator, we obtain 


" = \ E Hw {“kJL + «k. A.,) 

k.t , 

= E h( ° K.A.* + 5) 

k..v V 7 


(14.49) 


where in the last step we have taken advantage of the commutation relations 


L«k.,- 5 k',,J = S k.k >8,s (14-50) 

which follow from (14.43) and from the definitions (14.45) of the raising and low¬ 
ering operators. The reason that the Hamiltonian (14.49) cannot simply be obtained 
from the expression (14.40) for the energy of the field using the replacements (14.47) 
is that in working out (14.40) we assumed that the e k v 's and ’s were numbers, not 
operators that do not commute; thus we did not keep track of the order in which these 
numbers appeared in evaluating the Hamiltonian. The right way to derive (14.49) is 
to go back to the beginning and use the expansion (14.48) for the vector potential 
operator together with the commutation relations (14.50) from the start in evaluating 
the Hamiltonian 





+ (V x A) 2 


(14.51) 


In nonrelativistic quantum mechanics we are accustomed to replacing classical 
variables such as the position and momentum by operators. Now we see in a quantum 
treatment of the electromagnetic field that the field itself becomes an operator, an 
operator that annihilates and creates photons, as we will now show. This transition 
from a classical to a quantum field theory represents a conceptual revolution in the 
way we think about fields. It also indicates the way that quantum mechanics and 
special relativity, the two major cornerstones in the way we view the physical world 
that originated in the twentieth century, are joined together in the form of a relativistic 
quantum field. 
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THE PROPERTIES OF PHOTONS 

The lowest energy state of the Hamiltonian (14.49) is called the vacuum state and 
is denoted by the ket |0). This is a state such that for all values of k and s 

fl|J0)=0 (14.52) 

We can consider the ground state as a direct product of lowest energy states for each 
of the individual independent harmonic oscillators that comprise the Hamiltonian: 

10) = 10k,,,) ® |0 k2 ., 2 > ® |0 k3 ., 3 ) ® ■ • • (14.53) 

The energy of this ground state is determined by 

»lo) = E' to ( s iA, + ^)lo> * 

k.s ' A ' 

= MO) (14.54) 

1 k.t 

where of course a> — |k|c. Thus the ground-state energy £ 0 is the sum of the zero- 
point energies of each of the harmonic oscillators: 

E 0 =^J2 ho} (14.55) 

1 k.v 

Unless there is some cutoff in the theory that limits the number of oscillators with 
arbitrarily high frequencies, this sum diverges because there are an infinite number 
of such oscillators. Nonetheless, it is convenient to treat E () as if it were finite. We 
will see that it is only differences in energy that matter in any case. 5 

If we apply the raising operator a k v for one of these oscillators to the ground 
state, we obtain the state 

a k .,l°) = |0 Ml ) ® |0 k , t , 2 ) ® • • • a k ,J°k..v) ® 1 * • 

= |0 Ml ) ® |0 k2 , J2 > ® • • • |l t ,) ® • • • (14.56) 

For simplicity, we denote this whole state by 

iw=tfLi°> < 14 - 57 ) 

with the understanding that each oscillator except the one specified by the vector 
k and the polarization state s is in the ground state. The energy of this state is 


5 An interesting manifestation of the zero-point held energy is the Casimir effect, in which two 
neutral conducting plates attract each other because of vacuum fluctuations. See the discussion in 
Section 14.8. 
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determined by letting the Hamiltonian act on the ket ] l k s >: 

Alik.). £ w (a* ^ + 5) Um) 


Since 


and 


while 


then 


\ J2 hco ' = E o 


k\s' 


^.1^.) = IW 


^ = 0 k ^ k s ' 


(14.58) 


(14.59) 


(14.60) 


(14.61) 


H\l k<s ) = (E 0 + hco)\l ktS ) (14.62) 

« + 

Thus the energy added to the system by the action of the operator a k t on the ground 
state is Hco. 

We can see that we have added momentum to the system as well. Even classically 
we know that the electromagnetic field carries momentum. In fact, the direction of 
the momentum of the field is the same as the Poynting vector S P = (c/47r)E x B, 
which gives the field energy per unit area per unit time (see Problem 14.1). If we 
place a black disk in front of a light source, as illustrated in Fig. 14.3a, the disk 
will recoil as well as heat up as it absorbs momentum and energy from the field. We 
construct the momentum operator for the electromagnetic field by expressing the 
electric and magnetic field operators in terms of the vector potential: 

P=— [ cPr ExB 

4tt c J 


If we substitute the expansion (14.48) for the vector potential into this expression 
for the momentum operator for the electromagnetic field, we find 

L k„? 

= J2 hk ( a k,A,* + f) = fik 5 kA,* ( 14 - 64) 

k,.v ' L ' k,.v 


-“) x ( vxl ) 


(14.63) 
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Figure 14.3 (a) A black disk recoils as it 
absorbs photons, (b) The rate of rotation 
increases as it absorbs circularly polarized 
photons. 


where in the last step we have used the fact that 

Y fik = 0 (14.65) 

k,j 

since for every value of k in the sum there is a — k to cancel it. Thus, as expected, 
the vacuum state has no momentum: 


P|0} = 0 (14.66) 

Applying the momentum operator (14.64) to the state | l k v ), we obtain 

P|l k , s ) = Y fik 'VA'./'k,*) = f)k IW (14-67) 

k\s' 

Thus the state ] l k s ) has additional momentum /ik as well as additional energy ha> in 
comparison with the vacuum state. Since u> — |k|c, the additional energy fico and the 
magnitude of the momentum h\k\ are related by E = pc, as expected for a particle 
like the photon that moves at the speed of light. We can create a state with n k s 
photons, each with momentum hk and polarization s , by acting n k s times with the 
creation operator: 


(a k 5 )' 1|M 

l»k,,> - - 7 =- 10) (14.68) 

V n k,i- ! 

Recall from the commutation relations of the raising and lowering operators that 

= yj n k,s + 1 \( n + l)k,i) (14.69a) 

and 

4, s l”k,.v> = l(« ~ l)k. s > (14.69b) 
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Thus it is appropriate to call the operator fi k a creation operator for a photon, 
since it increases the number of photons in a state by one. Similarly, we call a k ^ an 
annihilation operator, since it reduces the photon content of a state by one. 

As we discussed in Chapter 2, the s label on the photon state indicates its 
polarization. For example, for a photon traveling in the z direction, the single-photon 
state |l k j) is an .v-polarized photon, while the state | l k 2 ) is a v-polarized photon. 
In particular, the right-circularly polarized state is given by 

|/?> = -^(|l kil )+i|l k>2 » = -^ (4 1 IO> + '«k, 2 IO>) < 14 - 70a ) 

while the left-circularly polarized state is given by 

\L) = (|i k .,> - ;|l k .2» = ^ (<,|0> - /< 2 |0>) (14.70b) 

Using the quantum theory of the electromagnetic field, we can check that these states 
do correspond to eigenstates of angular momentum with eigenvalues h and —h, 
respectively, along the direction of the momentum of the photon by verifying that 


|k| _V2 ( a M |0 > +/a k,2 |0> ) 


= h 


LV2 


7 ~ («k.ll0)+< 2 l°)) 


and 


J k 


-4«,io>--A,2lo>) =-» -4«,H»-.-aJ, 2 K») 


V5 


(14.71a) 


(14.71b) 


|k| Lx/2 

where the angular momentum operator for the electromagnetic field is given by 


i f ,3 (E X B) ,i A nn 

3=1 d rrx- (14.72) 

J 4ttc 

The photon has an intrinsic spin of one, as we also deduced from the behavior of 
the photon polarization states under rotations in Section 2.7. The classical physicist 
knows there is angular momentum in the electromagnetic field from the expression 
(14.72) (without the hats). For example, the disk shown in Fig. 14.3b will start to spin 
about its axis if the electromagnetic field incident on the disk is circularly polarized. 
But of course it is pure quantum mechanics that this angular momentum is quantized 
in units of h. 

Based on our discussion in Section 12.1 on the connection between the intrinsic 
spin of a particle and its statistics, we expect that the spin-1 photon should be a boson. 
This is confirmed by (14.68), which shows that there can be more than one photon 
with momentum hk and polarization s in the same state. This connection between 
spin and statistics actually entered our theory when we chose to make the creation 
and annihilation operators for photons obey commutation relations. In order to see 
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the effect of an alternative way of turning on quantum mechanics in a field theory— 
namely, anticommutation relations—that limits the number of particles that can be 
in the same state to one, see Problem 14.5. 


EXAMPLE 14.1 Calculate the expectation value of E and E 2 in the state 

l«M>- 

SOLUTION The electric field operator is given by 

E = -^ 
c 3 1 

__/ e i(kr-cot ) A ^-i(kr-oj/)\ 

= i 22 |^a k s e(k, s) — -j= -o k t e(k. 5)— -j=—J 

where we have used (14.48) for the vector potential operator A. Since 


<"k.vl 2 kJ' 7 k.v) = \ "k..« + 1 ("k.sK' 1 + Dk.j) =0 


and 


<«kj«k.>k.. f ) = a; <»k.ji(» - Dk.j=o 

we see that 


(n k>J |E|nk.,) =0 


Since E 2 contains terms of the form <5 k s a k s and « k f 3 k t , namely a 
creation operator followed by an annihilation operator and an annihilation 
operator followed by a creation operator, respectively, (n k s |E 2 1 n k s ) does not 
vanish. As our derivation of the electric field energy (14.37) illustrates, we 
need to take into account that the operator E involves a sum over all possible 
modes, namely over all possible values of k and s. If the only terms in this 
sum that mattered were the annihilation and creation operators corresponding 
to the particular k and ,v that label the state \n k s ), we would obtain 


( n k,.ilE 2 |«k.j> = (tfk.,«L 


+ 5 kA S~<s 


«k.. t * 


2i(k-r—co/) 


- KjL*-™-™ 0 ) I«k,) 

= ^7^("k.. t l ( 2 «k.A., + 1 ) K.v> 

= y(^s + j)^ 
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Thus the uncertainty per mode in the electric field strength in the vacuum is 
given by 

y<O k .,|E 2 |O k ,,> - (0 k .JE|0 k .,) 2 = ^jytico 

Therefore, although (O k 5 |E|0 k t ) = 0, there are nonzero vacuum fluctuations 
in the electric field strength. It can be argued that these fluctuations are the 
cause of spontaneous emission of a photon when an atom makes a transition 
from an excited state. We will examine spontaneous emission in hydrogen 
in Section 14.7. 

According to (14.53). the full vacuum state |0) is the vacuum for each 
mode, namely ■* 


|0) = |0 klt4 .,) ® |0 k2 ., 2 > <s> |0 kj ,, } > ® •. ■ 

Thus 

<o|E 2 io> = E v (01 KAy + >) v E « 

k ',j' V 

where £ 0 is the sum of the zero-point energies for each mode, which is 
consistent with 

(0\H\0) = ~ f d i r (0|(E 2 4- B 2 )|0) = £ 0 
07T J 

Clearly, the electric field portion of the Hamiltonian is contributing onc-half 
the total zero-point energy. 


EXAMPLE 14.2 The fact that (n k JE|n k s ) = 0 may be troubling to you, 
since, given the form of the electric field operator as a superposition of 
plane waves, you might have been expecting that the expectation value 
of the electric field in a state of /; k 5 photons should look like a traveling 
wave. Recall, however, that the energy eigenstates are stationary states, stales 
that do not exhibit any time dependence. In Section 7.8 we introduced the 
coherent state, a special superposition of energy eigenstates of the harmonic 
oscillator that we argued is as close as possible to the classical limit of a 
particle oscillating back and forth in a harmonic oscillator potential. Show 
that the expectation value of the electric field in the coherent state 


icf )=e | “ r/2 y 

« k ,.*=0 


or " 15 -' 


i« k ,#> 
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is given by 


3- 

~' T K ° |a|e(k, s) sin(k • r - cot + 8) 
r V 


where a = lev |e ,<5 


SOLUTION Since the coherent state is an eigenstate of the lowering (now 
annihilation) operator fi ks jar) = or|ar) (and consequently (or|<7 k f = {aria 1 *). 


(a|E|ar) = iy/2nhco(a\ ( ^elk. s) 


J(k-T-a>t) 


w 


a k s e(k, s)- 


-i(k‘r—o)t) 

Vv 


a) 


— / %/ 2tt hcoe 


(“ 


““yv - 


-i(k-r— col ) s 


Tv 


= - 2 . 


2tt hoj 


|ct|e(k, s )^7 


r-wf+il) _ g—/(k-r—arf+i) 1 


/ ^ 7T /? CO 

= —2,/ }L ° |a|«(k, s) sin(k • r — cot + S) 


which is just the form of a classical traveling wave. Moreover, the fluctuation, 
or uncertainty, in the electric field for the coherent state is the same as that for 
the vacuum. See Problem 14.8. Just as the coherent state of the mechanical 
harmonic oscillator is a minimum uncertainty state that oscillates back and 
forth in the well like a classical particle, so too the coherent state of the 
electric field has the same uncertainty as the vacuum and propagates like a 
classical wave with a well-defined phase. It is as close to a classical field as 
is possible for any quantum state of light. The output of a laser operating 
well above threshold can best be described in terms of coherent states. As is 
shown in Problem 14.7. the distribution of photons in a coherent state is a 
Poisson distribution with (n k t ) = |or| 2 . 


14.4 The Hamiltonian of the Atom and the Electromagnetic Field 


We are now ready to consider the interaction of photons and atoms. If you examine 
expression (14.48) for the vector potential operator A(r. r), you will notice that both 
the position r and the time r enter as parameters specifying the field. We emphasized 
in our discussion of the energy-time uncertainty relation in nonrelativistic quantum 
mechanics that t is not an operator but rather a parameter used to specify the state. Not 
surprisingly, in relativistic quantum field theory, position and time enter on an equal 
footing. In particular, the position r is no longer an operator but a parameter that, 
for example, we integrate over to express the Hamiltonian operator (14.51) in terms 
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of annihilation and creation operators. In a fully relativistic treatment of charged 
particles such as electrons and positrons, there is also another field—a function of 
r and / called a Dirac field—that is a superposition of creation and annihilation 
operators for electrons and positrons. In determining the full Hamiltonian for the 
interactions of charged particles with photons, the position r of the Dirac field is 
integrated over in the same way it is integrated in determining the energy of the 
electromagnetic lield. 

In the approach that we w'ill follow in this chapter, we will not use quantum 
held theory for the charged particles. It isn’t necessary, since our goal is to treat 
the interactions of photons with atoms, and the physics of the atoms is essentially 
nonrelativistic in nature. Photons, on the other hand, are inherently relativistic and 
thus including them requires a quantized electromagnetic held. If you look back 
to the derivation of the Hamiltonian (14.27) in Appendix E, you will see that the 
vector potential that enters the Hamiltonian in the form p — <r/A(r, t)/c is actually 
evaluated at the position of the charged particle. Thus in order to treat the interaction 
of the vector potential operator A(r, i) with charged particles in a manner that is 
self-consistent with the way that we treat it in evaluating the electromagnetic held 
energy, we need to use the position-space representation of the Hamiltonian for the 
charged particles so that the position r is treated as a valuable rather than an operator. 

For simplicity, we concentrate on the Hamiltonian of the hydrogen atom, includ¬ 
ing the Hamiltonian of the electromagnetic held. The Hamiltonian in the ccnter-of- 
mass frame of the atom is given by 6 


6 The Coulomb interaction of the charged particles actually arises from that portion of the 
electromagnetic field energy due to the scalar potential <p in the Coulomb gauge. The electric field 
energy is given by 


I 


8,t 



K 2 


_ 1 _ 

8 .t 



1 3A\ 2 
c dr ) 


8jt 

1 


8 -t 



_ _ 1OA 

Vip ■ V(p 4 - 2Vtp •- 1 - 

c dt 


—(pV-<p — 2 <p- 


3V • A 


c dt 


+ 



where in the last step we have performed two integrations by parts and assumed that the fields 
vanish at infinity so that there is no contribution at the end points of the integration. Note that 
the middle term in the brackets vanishes since V • A = 0. Finally, taking advantage of Gauss’s 
law (14.26) in the Coulomb gauge, we can write this expression for the electric field energy as 


j / * * + i?/ J ’ r (H£) “ j /*■■'** / + i / d>r (j f) 


For hydrogen p = — e<5-\r - r f ) + eS 2 (r — r ;l ), so that the first term yields the Coulomb energy 
-t' 2 /|r, — r p | of interaction between the electron and the proton, as well as the (infinite) self¬ 
energy of the particles, which we are neglecting. 
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H -> 


1 

2 m e 



1 

2m p 



1 

e 

r 





+ (V X A) 2 


(14.73) 


where the arrow indicates that the atom part of the Hamiltonian is given in position 
space. We have neglected the interaction of the proton with A because m p !>> m e . 
If we wish to include the interaction of the intrinsic spin of the electron with the 
magnetic field, we would obtain an additional contribution to the Hamiltonian of 
the form 

-5 

-^-S e • V x A (14.74) 

2 m e c 

where again we have neglected the interaction of the proton’s magnetic moment 
because of the large mass of the proton. 

Unfortunately, we are not able to determine the exact energy eigenstates and 
eigenvalues of the full Hamiltonian (14.73). The system is just too complex. Thus 
we must resort to a perturbative approach. We express the whole Hamiltonian in the 
form 


H = H 0 + 


(14.75) 


where 


H n 





+ (V X A) 2 


(14.76) 


is the sum of two Hamiltonians: the Hamiltonian for the hydrogen atom without 
interaction with the vector potential (see Section 10.2) and the free Hamiltonian for 
the electromagnetic field, which we examined in the preceding section. Thus we 
know the eigenstates and eigenvalues of H 0 . The perturbing Hamiltonian H { is the 
remainder of the full Hamiltonian (14.73): 


—*■ ——A • -V 4--——V • A H-—xA 2 (14.77) 

2 m e c i 2m e c i 2m e c 2 

The gradient operator in position space acts on a wave function, say for the initial 
state of the atom. Since 

V-A^ = (V-A)tfr+A- V^r = A- V# (14.78) 


because V ■ A — 0 in the Coulomb gauge, we can safely move the gradient through 
the vector potential operator in the second term of the interaction Hamiltonian 
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(14.77). Therefore, this Hamiltonian simplifies to 

Hi-* -A • —V + --~ A 2 (14.79) 

m e c i 2 m e c l 


14.5 Time-Dependent Perturbation Theory 


Our goal is to work out how a state such as an excited state of the atom evolves in 
time. However, since we are not able to determine the eigenstates and eigenvalues of 
the full Hamiltonian H = H () + H b we cannot determine the time dependence of the 
full system by expressing an arbitrary state as a superposition of energy eigenstates, 

\r!r(P)) = J2\E)(E\iK0)) * (14.80) 

E 

and then taking advantage of the time-development operator (4.9): 

|(K0) =e"'^ /fi |^(0)) = ^|£)(E|^(0))e-'' £,/fi (14.81) 

E 

Of course this procedure would also fail if the Hamiltonian itself were time depen¬ 
dent, as would happen, for example, if H { were to vary in time: H l — Such 

a situation arises if the system is subject to an external influence that changes with 
time, such as the spin system in a classical, external, time-dependent magnetic field, 
like the one we examined in Section 4.4. In order to handle cases such as these (see 
the example in this section and Problem 14.10 through Problem 14.12) as well as 
deal with time evolution for the Hamiltonian (14.73), we resort to the techniques of 
time-dependent perturbation theory. 

We begin by expressing an arbitrary state at the initial time t — 0 as a superposi¬ 
tion of the eigenstates that we know, the eigenstates of H 0 : 

h//(0)> = X |£f )(E^mO)) = X c„(0)|£' O) ) (14.82) 

n n 

We then write the time dependence in the form 

W(0> = X c„(0«“ i£ "° >f/fi l< 0) > (14.83) 

n 

If the Hamiltonian were to consist only of H 0 , the c„ would be time independent 
Thus if H ] is “small,” we expect that the time dependence of the c n (?) can be handled 
perturbatively. We will first demonstrate how this works using techniques similar to 
those that we used in Chapter 11, and then we will show how we can obtain the result^ 
to all orders in a more compact, elegant manner (that is especially appropriate for 
quantum field theory) utilizing what is termed the interaction picture. 
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We obtain the equations governing the time evolution of the c n (t) by substitut¬ 
ing (14.83) into the Schrodinger equation 

H\f(t))=ih^-m)) (14.84) 

dt 

yielding 




= ihJ2 


d iE < 0) 

-c„(t) - ~^c„(t) 
cit a 


e-iE^./h |£ ( 0)) 


Taking advantage of the fact that 

4l£, ( , 0) > = £fl^ 0) ) 


(14.85) 


(14.86) 


allows us to simplify the equation to 




-iEfh/R 


di 


c n(‘) 


I) = J2 c n «)e-^>%\Ef>) (14.87) 


where w'e have swapped the left-hand and right-hand sides of (14.85). Assuming 
{Ef\E<®) = S fn , namely the eigenstates of H 0 form an orthonormal basis, w'e can 
project out the time derivative of a particular c n , say c f. by taking the inner product 
of (14.87) with the bra (E ( , 0) |, which yields 


dt n — J 


r(0)\ 


(14.88) 


In general, this is a complicated set of coupled differential equations that is too 
di ff icult to solve exactly since dcj{t)/dt is coupled to each c n (t) for which the 
matrix element (E^li/jlE* 0 ') is nonzero. Consequently, we resort to a perturbative 
approach. 

As in Chapter 11, we insert a parameter A in the Hamiltonian to keep track of the 
order of smallness (H\ —> A H x ) and expand the coefficients c n (t) in a power series 
in A: 


C n (t) = cf + Acf + k 2 cf + • • • (14.89) 

Making these substitutions on both sides of (14.88), we obtain 

4( c ® + x,«» + xVf + ...) 

= -( £>»> + + X 2 4 2 > + ■ ■ (14.90) 

n 
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r he only term of order A 0 is on the left-hand side of (14.90), indicating that 

—cf = 0 (14.91) 

dt 1 

We will assume that at time t — 0 the system is in an eigenstate |£’. 0) ) of Hq. Thus 
the initial conditions for the coupled differential equations (14.88) are 

9,(0) = (14.92) 


The initial condition (14.92) is satisfied provided 


cf = S fi and cf ) (0) =0 for k > 1 


(14.93) 


Collecting the A 1 terms in (14.90), we obtain 


f>OT = -'-T 

n 


dt 


(14.94) 


which can be integrated to yield 


4 '>«) 


—i /V<‘ 

h Jo 


(4 0) -£< 0) )!'/ft <£ ( 0),^ 


(£' u, |W I (r')|£ i - 0) > (14.95) 


In (14.95) w 7 e have allowed explicitly for the possibility that //] depends on time. 
Combining (14.93) and (14.95), we see that through first order 

c f (t) - 5 fi - l - f dt' e iiE T- E i m), ' /h (Ef>\H l (t , )\ E j 0) ) + ■■■ (14.96) 
fi J 0 


EXAMPLE 14.3 The Hamiltonian for a charge q in a one-dimensional 
harmonic oscillator in a classical (not quantized) electric field, w'hich we will 
take here to have the time-dependence |E| = |E 0 |e~ f / T such as would arise if 
the oscillator were situated between the plates of a discharging parallel-plate 
capacitor, is given by 


Choose 


H = + -morx 1 

2 m 2 


-qx |E 0 k f/T 


p\ 


.2 c2 


H () = — 4 —mco x 
2m 2 
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and 


Suppose that at t = 0 the oscillator is in the n = 0 ground state. What is the 
probability that the oscillator is in an excited state at t = oo? 


SOLUTION From (14.96) 


C|i( oo) = fil5o[ f dt’ e incoI 'e- ,,/T (n\x\0) 0 
h Jo 

Expressing the position operator in terms of raising and lowering operators, 
we find 


c„(oo) = 



* 

dt' e int,,r 'e- t ' /x (n\(d +d f )\0) 
dt' e inw, 'e~'' /T (n\\) 


Thus only Cj(oc) is nonzero: 


Cj(oo) = r dt' e^'e-" x 

n V 2m co J o 

_ >?|Kq1 / h t 
h V 2 mco 1 — ia)T 

The probability of the oscillator making the transition from the ground state 
to the first excited state is given by 


|c,(oo)| 2 = 


(glE 0 |r) 2 1 

2 mhcu 1 + oj-t- 


This result is our first hint of how a selection rule might arise: Since the 
position operator in the perturbing electric dipole Hamiltonian H x involves 
a single raising and a single lowering operator, only transitions in which the 
quantum number n of the oscillator changes by 1 are permitted through first 
order in perturbation theory. 


THE SCHRODINGER PICTURE 

Before going on, it is instructive to rephrase our discussion of time-dependent pertur¬ 
bation theory using the interaction picture. First, let’s summarize time development 
in the familiar Schrddinger “picture” that we have used so far in our discussion of 
lime evolution. In this picture, states evolve in time according to 

W s (t)) = U s (tM s m (14.97) 
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where 


U s (t) = e ~ ilh/n (14.98) 

hen the Hamiltonian H is independent of time. In general, as shown in (4.7), the 
time-development operator satisfies the differential equation 

ih^-U s (t) = HU s (l) (14.99) 

at 

Note that we have added a subscript S to the slates and to the time-development 
operator to distinguish them from states and operators in other pictures. According 
to (4.16). the expectation value of an operator O s is given by 

dW 5 U)\O s \*sl‘)) = i {x j, sm u 3 d s W s (t)) (14.100) 

at n 

where we are assuming that the operator O s does not itself depend explicitly on time. 

THE HEISENBERG PICTURE 

An alternative to the Schrodinger picture for describing time evolution is the Heisen¬ 
berg picture. In the Heisenberg picture, it is the states that are constant in time: 

llM0> = ul(tM s (t)) = e'^ t,fl \\j/ s (t)) = liMO)) (14.101) 

Since the time-development operator U s (t), which evolves states forward by time / 
[see (14.97)J. is unitary, namely U^(t)U s (t) = 1. 0j.(r) is the inverse of U s (t) and 
evolves states backward by time t, thereby leaving the state as it was at t = 0. On 
the other hand, an expectation value can be written as 

(iMOlWsW) = <^(0 )\e ifi,ltl d s e- ifl, l n \f s m 

= (^ H \e if " /n 6 s e-‘ lh/,i \ikn) (14.102) 

which suggests that we define the operator 0 H in the Heisenberg picture by 

d H = e ift,lh d s e- nh/h (14.103) 


so that 


<^s(0KWs(f)> - W H {t)\6„{tM„(t)) (14.104) 
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Notice that 


d On 
dt 


de im/R 

dt 


6 s e- iH,/h + e iHt/n 6 s 


de~ iHl / Tl 

dt 


= - (e ilh/n Hd s e- i ” ,/h - e ifi,lfl 6 s He~ iiltlh 


= l -w, 0 H ] 


(14.105) 


■ here in the last step we have taken advantage of the fact that the Hamiltonian com¬ 
mutes with the time-development operator. 7 Thus in the Heisenberg picture the state 
vectors are fixed in time while the operators carry all the time dependence. Although 
we have phrased our whole discussion of nonrelativistic quantum mechanics to this 
point in the Schrodinger picture, the Heisenberg picture is a natural picture for quan¬ 
tum field theory. In fact, we slipped into this picture naturally when determining the 
\ ector potential operator, which in (14.48) varies in time. 


EXAMPLE 14.4 Determine the time evolution of the raising and lowering 
operators for the harmonic oscillator Hamiltonian 


H = —+ — mco 2 x 2 = hco 
2m 2 



in the Heisenberg picture. 

SOLUTION We will take operators without subscripts to be operators in 
the Schrodinger picture. We can use (14.105) to determine how the operators 
evolve in time in the Heisenberg picture. In particular, the lowering operator 
in the Heisenberg picture satisfies 


da H _ i r/r - n 
dt ri 

= -e ilh/h [H, a]e-^ t/n 

h 

= -e if,tin {-h(od)e- ifl,lh = -icoa H 
h 

The solution to this differential equation is 

d H (t)=d H (0)e- iw =de- iw ' 


Note that the Hamiltonian is the same in the two pictures. 
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where the last step follows since the two pictures coincide at / — 0. Similarly, 
we find that 

a),{t) = a^(0)e iM = a f e im ' 

Notice that if we express the Hamiltonian in terms of the operators in the 
Heisenberg picture, the time dependence cancels out as it must because the 
Hamiltonian itself is independent of time: 

H = hat ) + - 

= tko ^<S^(0)<7//(0) + ^ = fut) ^ct'ci + ^ 

Also note that the commutator of the raising and lowering operators in the 
Heisenberg picture is given by 

[a H {t), a]j(t)]= 1 

provided we are careful to evaluate the commutator at equal times. See also 
Problem 14.13. 

THE INTERACTION PICTURE 

We are now ready for an intermediate picture in which both the operators and the 
states carry some of the time dependence. We presume that we can break up the total 
Hamiltonian into two parts in the Schrodinger picture: H — H {) + Hy. We define the 
state in the interaction picture by 

( 14 . 106 ) 

Thus if Hy were to vanish, we would be evolving the state backward in time to its 
value at t = 0, as is the case in the Heisenberg picture. But since we are presuming 
that Hy does not vanish, the state 1^/(0) does vary with time and its time dependence 
in the interaction picture is governed by Hy. 

at at 

= -H 0 e i "° , ' K \xlr s (t)) + e '^ ,/n (H () + HyMs(O) 

= e i6 ° lin H x \ty s (t)) = e^o'^Hye-^o'^Wiit)) (14.107) 

If we examine the expectation value of an operator, we find 

Ws(t)\O s W s (t)) = #/(/)k , "° ,/ ' £ O 5 e-' />0/M l*/<0> (14.108) 
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which suggests that we define an operator in the interaction picture by 

6, = j I *H h 6 s e- iil ° ,/h (14.109) 

Presuming again that the operator O s in the Schrodinger picture does not depend on 
time, the time dependence of the operator O t in the interaction picture is governed by 

d -~ = j {e ifl ^ n H 0 d s e~ , ^ h - 

= kH 0 ,6,] (14.110) 

n 

Thus the time development of operators in the interaction picture is determined by 
H { j. From the definition (14.109) of operators in the interaction picture, we see that 
the Hamiltonian H () is the same in both these pictures. On the other hand, H l and 

H u = (14.11 I) 

differ, since Hq and do not in general commute. Consequently, even if // 0 
and H } are both time independent, H u will, in general, be time dependent. The 
time evolution (14.107) of the states in the interaction picture can be expressed 
conveniently in terms of H u as 

= (14.112) 

dt 

Equations (14.110) and (14.112) are the fundamental equations governing time 
dependence in the interaction picture. In this picture the time evolution of the 
operators is determined by and time evolution of the states is determined by 
H\. The interaction picture is intermediate between the Schrodinger picture, in 
which the states carry all the time dependence, and the Heisenberg picture, in which 
the operators carry the time dependence. It is convenient to express the solution 
to (14.112) in terms of a time-development operator in the interaction picture: 

IV'/(0) = t//(01^/(0)) (14.113) 

Because H u depends on time, the time-development operator in the interaction 
picture is not simply given by e~ lIIu ‘/ fl . However, from (14.112) we see that the 
time-development operator in the interaction picture satisfies the equation 

= (14.114) 

dt 

and thus we can at least write a formal solution in the form 

Vj{t) = \- 1 - f dt'H u (t')U,(t') (14.115) 

h Jo 
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as can be verified by substituting this expression into (14.114) and noting that 
(7/(0) = 1, the appropriate initial condition. We can obtain a perturbative solution 
by iteration: 


U/(t) — I 

= 1 


f fdt'Huit') 

h Jo 

j fdt'HuQ') 
h Jo 


1 - ^ jf dt"H u {t")U,(t ")j 

+ (“) j dt'H u it') J' dt"H u (t”) + • • ■ 


(14.116) 


If, as before, we assume that the perturbation is small and insert a parameter k in H ] 
to keep track of the order of smallness, the expansion (14.116) can be considered 
as a power series in k. We hope that the series converges sufficiently rapidly that 
retaining the first few terms in the series gives a good approximation for the time 
development of the system. 

Finally, let's return to our initial problem of determining the c„(t) in (14.83). 
Comparing (14.106), which defines the stale |tA/(/)) in the interaction picture, 
with (14.83), we see that 


\ih(t)) = e i "°" h W s (t)) 

= e iH o ,,n c„(r)e _ ' £ "°’' /fi |£< 0) ) 
n 

-^c„(r)|£< 0) ) (14.117) 

n 

Thus 

c f it) = <£jV/(0> (14.118) 


If, as in (14.92), we choose the initial state to be an energy eigenstate of H 0 , namely, 
|(MO)) = |£' 0> ). t l 1en using the expansion (14.116) we find 

(£f|(7 / (r)|£f ,) ) = <£f |£, (0) ) - i jf dt'(E^\H u (t')\El 0) ) + ■■■ 

= 8 fi ~ - f dt'iEf\e iA ° i ' ,n H [ e~ ifio, ' ,h \Ef ) ) + ■ • • 
h Jo J 

= S fl - I f dt'e^-^'^EfmE?') + ■. . 
n Jo 1 

(14.119) 
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in agreement with (14.96). Thus the probability of making a transition from the | Ef 1} ) 
to the state \E'f l ) is given by 

\c f (t)e~ iE< f ),/n | 2 = \c f (t)\ 2 = \{Ef \U,(.t)\El 0) )\ 2 (14.120) 


14.6 Fermi’s Golden Rule 


We are ready to examine the time evolution of an excited state of an atom. We take 
the initial state to be an energy eigenstate | i) = \n t> I h ;«, ) <g> |0), where the atom is in 
the state \n h m ( ), with no photons present. The final state is | n f. / j, m f) <S> |1 ), 

where the atom is in the state \n f, lj , m j) and a photon with momentum hk and 
polarization s has been emitted. For example, we might be interested in calculating 
the lifetime for a hydrogen atom in the 2 p state to emit a photon and make a 
transition to the Is ground state. The Hamiltonian H { is given by (14.79). Since the 
total Hamiltonian (14.73) is the Hamiltonian for a closed system, with no external 
sources or sinks of energy, it must be independent of time. Thus we can use the 
expansion (14.48) for the vector potential evaluated at a particular time, say t = 0, 


2nfi 


to express the Hamiltonian in terms of the annihilation and creation operators for 
photons. The amplitude to find the system in the state | n f,lj-, mj) <g> | l ki ) at time 
t is given by 




^ k .i e (k, S) -J= +« k 5 e(k. s )• 


(k*r 


Tv J 


(14.121) 


A(r) = V c 


<1k,J ® nif\U,{t)\n n /,-, rrij) ® |0) 


(14.122) 


or more simply 


<f\Ui(t)\i) (14.123) 

where it is understood that the initial and find states are eigenstates of H 0 . 

Since the in (14.79) is independent of time in the Schrodinger picture, eval¬ 
uation of the time integral in (14.119) is straightforward. Defining 

n = (E ( p - Ef^/h (14.124) 


we obtain 


f cit’e^ 1 ' = -{e^ - 1) = 

J o if] 


Jm/ 2 

-sin(p//2) 

(h/2) 


(14.125) 
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•» 


Thus the probability of making a transition is given by 


K/|tf/(0|i>l 2 = 


1 sin 2 (pr/2) 

K 2 (h/2) 2 


l(Ml*>l 2 


(14.126) 


Figure 14.4 shows 


sin 2 (pf/2) 

in/ 2) 2 


(14.127) 


plotted as a function of rj. There is clearly a nonzero probability of making transitions 
to states with energy such that p ^ 0, that is, to states such that £^ 0) / Ef ] . For an 
atom in an excited state making a transition to another state with the emission of a 
photon, this means 


E nf + fio}yL E n . 


(14.128) 


where the energies E n . and E Uf are the initial and final energies, respectively, of the 
atom. (For the hydrogen atom, E„ — —/ic 2 a 2 /2rr.) 

How then does conservation of energy arise? Notice that the first zero of (14.127) 
occurs when tij/2 — : r, or p = 27r/f. Also notice that (sin 2 pt/2)/(p/2) 2 t 2 as 
p 0. Thus as t increases, the function (14.127) becomes narrower and higher, 
and the probability of making a transition to a state that doesn't conserve energy 
decreases. In fact, for large t this sounds like a Dirac delta function, except that the 
area of the central peak, which is roughly the height times the width, is growing 
like t. Indeed, we can make use of the representation of a Dirac delta function 


lim — 

r->-oo n 


sin 2 (fp/2) 

f(p/2) 2 


= 6 



(14.129) 
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to express the transition probability as 

lim l(/|f/ / (r)|/)| 2 = 7r/ y 2) l(/ltf,l01 2 (14.130) 

l-+oo ffl 

Thus in the large-/ limit we see energy conservation appearing. For finite times, 
we should not expect strict energy conservation in any case. After all, the energy¬ 
time uncertainty relation AEAt > h/2 implies that if the evolutionary lime A/ for 
the system is finite, the energy of the system is uncertain by AE > h/2 At. In our 
example, the time t is such an evolutionary time. Thus, as (14.126) shows, we should 
expect to find transitions to states with a spread in energy A E, where A Et ~ h. In 
practice, the evolutionary time / imposed by the experimental setup makes the large¬ 
time limit the appropriate one. For example, detecting nonconservation of energy by 
one part in 10 6 for photons emitted in a transition in the visible part of the spectrum 
would require that the time / be on the order of 10“ 9 seconds. In a typical experiment 
in which atoms are excited in a discharge tube, we do not know when the atom was 
actually excited to this precision, let alone interrupting the time evolution of the 
system within this time period. Such interruption can occur naturally, however, such 
as when the atom is dc-excited by colliding with other atoms in the discharge tube. 
Although, in principle, such collisions could shorten the “natural” evolutionary time 
for the system, leading to a spread in energy A E in the photons emitted (collision 
broadening), the natural time scale for such collisions is actually large in comparison 
with the natural lifetime. 8 

Neglecting effects such as those due to collisions, we can safely calculate the 
transition probability in the large-time limit using (14.130). At second glance, the 
appearance of the energy-conserving Dirac delta function in this equation may 
now seem disturbing. After all, when t] = 0, the probability of making a transition 
appears to be infinite. However, the transition probability (14.130) is not physically 
significant. It is not possible to observe a transition to aparticular final state involving 
a photon with a particular k. Any detector that we use to detect photons counts 
photons within a range of angles, which are determined by the solid angle subtended 
by the detector. Also, there is always some energy resolution for the detector; 
it detects photons within a range of energies. In order to compare (14.130) with 
experiment, we need to calculate 

k+Ak 

£ \{f\Ui(W )\ 2 (14.131) 

k 


s The natural lifetime of the excited state sets an evolutionary time for the system that is on 
the order of 10 -9 seconds, as we w ill calculate in the next section, and therefore from the energy- 
time uncertainty relation we should expect to see a spread in the energy of the emitted photons 
that is on the order of 1 part in 10 6 . 
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where the range of Ak included in the sum over final photon states is determined by 
the resolution of the detector. 


THE DENSITY OF STATES 

How many photon states are there between k and k + Ak? Recall that we are working 
in a cubic box of volume V = L 3 and that the allowed photon states are discrete, as 
. indicated by (14.34). In order to count the density of states, we set up a lattice with 
each state represented by a point in the lattice with the coordinates (n x , n Y , «,), as 
shown in Fig. 14.5. Each point on the lattice labels a photon state. 9 Since the radius 
in the lattice is given by 


r = v /' , i + "5 + "l 

=hf k F+*i+^=£* <i4i32) 


and there is one state per unit volume in this lattice, the number of states between r 
and r + A r is given by the volume of the spherical shell between these two radii. 


Anr 2 Ar = An 



(14.133) 


assuming A r <3C r. Notice that each positive and negative integer on the lattice 
corresponds to a different photon state, and therefore we have to count the number of 
states in the whole spherical shell between r and r + Ar. The fraction of the states 
in a solid angle AS2 is just given by 


— \ k 2 AkA& (14.134) 

2n) 

As V becomes large, the spacing between photon states becomes small, ap¬ 
proaching a continuum as V -*■ oo. In this case, the number of states between k 
and k + dk and between Q and £2 + dQ is given by 10 


I V 

— k 2 dk dQ = -- d 2 k (14.135) 

In) (2n) 3 


“ Strictly, there are two photon states for each value of k. taking into account photon polariza- 
tion. To get the total transition rale at the end we must sum over these different polarizations. 

Incidentally, if we make the substitution p = fik, we see that the number of states between 
p :!id p -t- </p is given by 


Vd 3 p _ Vd } p 
(2 n) 3 h? h 3 

-r a mg that each stale occupies a volume h 3 in phase space. 
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Figure 14.5 Each allowed value of k corre¬ 
sponds io a point in the (n x , n v . /;.) lattice. For 
the sake of clarity, only a few of the points are 
shown. The number of points between k and 
k + A k is given by the volume of the spherical 
shell between r and r + Ar. 


Consequently, the sum in (14.131) is replaced by an integral: 

£ \(f\U,m)\ 2 = / -T-TaK/ltf/WI/)! 2 (14.136) 

k J (2jt ) j 

where the limits of the integral are determined by the resolution of the detector. Using 
E = haj = tike, we can write the number of states (14.135) as 

—-~dQdE = pdQdE (14.137) 

(2 7T) 3 He 3 

where w'e have labeled by p dQ the density of states with photon energy between E 
and E + dE. Notice that p dQ is proportional to or. that is. the (angular) frequency 
squared. Incidentally, our notation is somewhat unconventional because p dQ is 
often just called p for historical reasons. 

THE TRANSITION RATE 

We are now ready to determine the transition probability to a set of photon states 
that arc covered by the detector: 

hm £ |(/|U/(0|/)| 2 = J dE J dQ nt6 ^J 2) \(f\H i \i)\ 2 p (14.138) 

where the angular and energy limits of the integration are determined by the resolu¬ 
tion of the detector. Since 

J (i)= s ( 

= 2hS[E - (£„, - E„ f )] (14.139) 

the transition probability per unit time into a solid angle dQ that we obtain by 
carrying out the energy integration in (14.138) and dividing the probability of making 


"■’iif 4- E E n . 
2h 
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- transition by the total time t is simply given by 

dR = ^-\(fmi)\ 2 pda (14.140) 

h 

Integrating over all solid angles and summing over the two polarization states of the 
photon, we obtain for the total transition probability per unit time to a particular final 
>iate of the atom 

R = T,Jy\( f\Hi\i)\ 2 P d n (14.141) 

The result (14.141) is often referred to as Fermi’s Golden Rule. Note that R, 
the transition probability per unit time, is independent of time. Since probability of 
decay in time dt is given by R dt, the probability of the atom decaying in the next 
time interval dt does not increase the longer the atom survives. For a sample of N 
atoms excited at l = 0, the number dN that decay in time dr is given by 

dN = —NR dt (14.142) 

which can be integrated to yield 

N{t) = N(Q)e~ R ' 

= N(0)e~ l/T (14.143) 

Thus the lifetime r for this decay is given by z = 1/i?. 11 

14.7 Spontaneous Emission 


In order to determine the lifetime for spontaneous emission of an excited state of 
the hydrogen atom, we need to calculate the matrix element in (14.141). We use the 
interaction Hamiltonian 

f' 2 

H] —> —A ■ -V H--—-A 2 (14.144) 

m e c i 2 m e c 2 

In evaluating the matrix element 

<l k .,l ® m ,) <g) |0) (14.145) 

the photon part is the easiest. The only term in the expansion (14.121) for the vector 
potential in terms of annihilation and creation operators that contributes to this matrix 
element is when the particular s that creates the final-state photon acts on the 


Fortunately, more complicated systems such as human beings, who can show signs of aging, 
r.ccd not conform to this behavior. 
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photon vacuum state |0). Note that the term in the Hamiltonian involving A 2 cannot 
contribute to this matrix element, since it involves terms such as a k s a k , „ which 
changes the number of photons in the initial state by two. Here we are calculating 
the amplitude for emission of a single photon. The reason that two-photon emission 
is less likely than single-photon emission is that the rate for single-photon emission 
is proportional to e 1 (from the square of the matrix element), while the rate for two- 
photon emission is proportional to e 4 . We can use h and c to express these factors 
in terms of the fine-structure constant a = e 2 /hc = 1/137. Thus the rate for single¬ 
photon emission is on the order of a, while the two-photon emission rate is on the 
order of a 2 . 

Using the interaction Hamiltonian (14.144). we find 


(lk.il ® («/, If, tn f \fi]\ni, l h mj) ® |0) 



d 3 r 

Yn f’ 




e ,k r £(k, s) • yVt/r n .,. m . 


(dk^ClO)) 


(14.146) 


Note that 


<lk,il^|0)=(l M |l M > = l (14.147) 

Interestingly, for emission in the presence of the photons, each with momen¬ 
tum hk and polarization s, we would have 

((« + l)k,*J^k t j l w k,i) = yj n k,s + 1 ((” + l)k.jl(n + l)k,i) =-J l1 k,s + 1 (14.148) 

Thus the transition rate for stimulated emission is a factor of n k s + 1 larger than for 
spontaneous emission. This is the key to the operation of the laser. A resonant cavity 
is set up that traps photons with a particular k, say between two reflecting surfaces, 
as shown in Fig. 14.6. Then subsequent decays of excited atoms are more likely to 
be into states wi th a particular type of photon if there are already photons of this type 
present. 



Mirror Partly 

transparent 

mirror 


Figure 14.6 A resonant cavity formed by 
two reflecting surfaces traps photons with 
a particular wave vector k. These photons 
then stimulate emission of other atoms in 
the cavity that have been pumped into the 
excited state. 
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THE ELECTRIC DIPOLE APPROXIMATION 

How large is the argument k • r of the exponential in (14.146)? The size of r is on 
the order of the size of the atom, that is, on the order of a Q , the Bohr radius. The 
wavelength X for transitions in the visible is 4000-7000 A, w'hilc even for Lyman 
a. the n = 2 to n — 1 transition, the wavelength is on the order of 1200 A. Since 
k - litIX, k r « 1. and it is a good approximation to use the series expansion 

e~ fkr = 1 - /k ■ r + - r) ~ + • • • (14.149a) 

2! 

and replace the exponential with the first term: 

e~' kr _» 1 (14.149b) 

This straightforward mathematical approximation leads to the electric dipole approx¬ 
imation, in w'hich the effective interaction Hamiltonian in position space responsible 
for the transition becomes 


Hi -» —fi e ■ E 


(14.150) 


To establish this result, it is convenient to use a small trick. For the Hamiltonian 


~2 ■> 

h 0 =V--!L 

2n |f| 

the commutator of the Ha mi ltonian and the position operator is given by 


(14.151) 


[H 0 , T i ]= —[p 2 , X;] 

2II 

= J- T\PjPj> ( pAPj ’ + tPy XilPj) 

^ j fX j 


= --J2 Pj iti8 U = -—Pi 


(14.152) 


A' 


A 


Thus 


(n f , l f , m f\pi\njlj, m t ) = y (n f ,l f , m f \[H 0 , m,-) 


= y (E„ f - (n f , If, rriflXilrii, m,) 


or 


— —ipco(n f, If, mj\x i \n i , m ; ) (14.153) 

(nf, lf, m y|p|n ; , l h m,-) = -ifia)(rif, If, m y|r \n h l h m.,) (14.154) 
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where hto — E n . - E n is the energy carried by the photon. Using this result, we see 
that the matrix element of the interaction Hamiltonian taken between the initial atom 
state and the final atom-photon state is the same as that generated by the replacement 

e * h —e 34 

-A ■ —V -»■- - r = er E = ~n e E (14.155) 

m e c i c 3 1 


in the limit that we make the replacement of the exponential (14.149) by one. 12 
The transition rate (14.140) can now be expressed as 


2n ( jlrth 

dR ~ — 

h 


(/If) eV 1/ * ■ s(k ’ *> 

„ * 

aa> 3 I C q 

= 2nc* I J d Y ^f,l f , mf r ' «(k, s) 


2 Va? dQ 


(2 n) 3 c 3 ti 

* 

dtt (14.156) 

where in the last step we have introduced the fine-structure constant a. 


THE LIFETIME OF THE 2p STATE OF HYDROGEN 

Our problem now reduces to evaluating the matrix element of the position operator 
between the initial and final atom states. Note that the components of the position 
vector r can be put in the following form: 


~^ (X + iy) = "75 rei * sin 0 = V f rF u 

~(x — iy) = —(= re sin 9=.—rY i _i 
V2 sfl V 3 ’ 


z — r cos 6 — 



In terms of these components 


(14.157a) 

(14.157b) 

(14.157c) 



12 In calculating the time derivative in (14.155), we can take the time dependence as given 
by (14.48), since the operators in the interaction picture evolve according to Hq. We then evaluate 
the whole Hamiltonian at / = 0 to determine the form of H[. Note the radius vector r points from 
the proton to the electron and thus the electric dipole moment of the electron-proton system is 
given by ti e = — er. Also, we have replaced the reduced mass in (14.154) by the electron mass. 
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Thus 


/ d * r 'K h l t , mf r ■ e(k ’ 

/ , fkn fe x + ie x e x — ie x \ 

* r V/W t ~ -^-‘•a.+^AoJ 

(14.159) 

In order to calculate the lifetime of the 2 p state of the hydrogen atom, we must 
evaluate the matrix element 


f d * r K o.o r ' £ ( k - 

= / d3 >- Wo 'Vy (-yy^ K '.-i 


V2 


'^1,1 + e z Y \ 


0^ ^2,1^1, 


>»l 

(14.160) 


Note that y 0 0 = \/\/An is a constant, while from (14.157) we see that T, , = — T*_,. 
Thus in carrying out the angular integrals in (14.160), we can take advantage of the 
orthogonality relation 


/ 


W m Y Um =8 m , mt 


(14.161) 


to express (14.160) as 


/ d3 rr iAQ r-e(k,s)+ 2Xmi 

/T / e x + is \ e x~ ie v \ 9 

= V5 (“yr^i + + l drr 2 R; Q rR 2A 


(14.162) 


Notice that if, for example, the atom is initially in a state with m , = 1, then the 
only nonvanishing piece of the matrix element contributes a term proportional to 
e x + is y . If the photon is propagating in the z direction, this corresponds to a right- 
circularly polarized photon state, as discussed in Sections 2.7 and 14.3. Thus the 
angular momentum of the initial state is carried off in the intrinsic spin of the emitted 
photon in this particular case. In general, a photon emitted in some other direction 
carries the angular momentum of the atom in a combination of both orbital and spin 
angular momentum. 

Using the radial wave functions (10.43) and (10.44b), we find 



dr r 2 R* 0 rR 2J = 



(14.163) 
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Thus the absolute square of (14.162) is given by 


l 

3 


I „2 , -2 
E + S 
x y 


°m i . 1 


el + Sy 
+ JL r^S„ 


+ 


m, .0 


)I5 


3 9 


(14.164) 


We can determine the transition rate separately for each of the three possible in,-. If, 
as is commonly the case, the atom is initially unpolarized and thus each of the three 
nij values occur with equal probability, we can obtain the transition rate by averaging 
over these different values of 


5 ?|/ 


dr r i0 0 r • g ( k - s)fl,u 


2' 5 
^To u o 


al-is] + s 2 y + s 2 ) 


2 15 2 
3" fl ° 


(14.165) 


The transition rate is independent of the direction of e when we average over the 
initial m values. The total transition rate is obtained by integrating over all possible 
directions in which the photon is emitted and summing over the two possible 
polarization states for each photon: 


R 


2p —►!.? 



cz m 3 2 15 2 
2^3^ 0 = 


aa? 2 17 9 
“^~3^ 


(14.166) 


Since 


hco — E 2p 


~ Els = 


1 1 2 
-m r c~a 
2 



3 2 2 

= pn e c a 


(14.167) 


we can express the transition rate in the form 


Elp-yls ~ 



= 0.6 x 10 9 s -1 


(14.168) 


The lifetime for the transmission is therefore 


x 2 P ^is — ~—•— = 1-6 x 10 9 s 


R-> 


2p-> U 


(14.169) 


Our ability to calculate this lifetime from first principles is one of the triumphs 
of quantum mechanics. Not only can quantum mechanics make detailed predictions 
about the energy levels of the hydrogen atom, it can also predict the rates for the 
transitions between these levels as well as the angular distributions of the photon 
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emitted in the decay. 13 In fact, using quantum mechanics, we are able to predict the 
results of any measurement that the experimentalist can make on the hydrogen atom. 


MAGNETIC DIPOLE AND ELECTRIC QUADRUPOLE TRANSITIONS 

A transition in which the matrix element of the electric dipole Hamiltonian between 
the initial and final states is nonzero is known as an electric dipole transition. 
However, if we evaluate the probability of a 3 d state of hydrogen making a transition 
to the Is ground state using this Hamiltonian, we find that the matrix element 
vanishes. We can see the reason by evaluating the angular part of the matrix element 
f d i r ti!t *0 0 r • e(k, .v)V r 3 , 2 I m / > which involves integrals of the form 

j di 2 Y* 0 Y lm Y 2 , m . = -±=JdG Y lm V 2 , m . (14.170) 

which vanishes since 

0 < 141711 

Although an electric dipole transition from the 3 d state to the 2 p state is allowed, and 
a subsequent transition to the ground state through a second electric dipole transition 
is also permitted, (14.171) shows that a direct electric dipole transition between the 
3 d state and the ground state is not possible. 14 This raises the question: Is any direc: 
transition with the emission of a single photon from the 3 d state to the Is state- 
allowed? 

In fact, the Hamiltonian (14.144) contains not just the electric dipole Hamilto¬ 
nian but higher multipole contributions as well. To see how such contributions arise, 
consider retaining the next term in the series expansion (14.149a) of the exponen¬ 
tial e -,k ' r in evaluating the matrix element. This leads to —/k • re • (fiW/i) sand¬ 
wiched between the initial and final atom wave functions, instead of just e ■ (h V 1 1 
in (14.146). To see the physical significance of this term, it is helpful to express the 
operator version as 


—tk re • p = -Hk re • p + e fk p) — Hk re • p — e rk p) (14.172' 


13 The agreement between (14.168) and experiment is. of course, excellent. 

14 In the general case of a transition from the state In,, /,-. m,) to the state |n y, If, m f), the 
matrix element (14.159) is proportional to J dQ }'* m Y l m Y t m .. The product of the two Y ; ,. - 

can be expressed as Y\ <m Y lhm . = £/.,« c l,mYl,m< where L = /, + 1. /,-, /,■ — I, assuming I, # <> 
After all, the Y/ m 's do form a complete set, and the values of L are determined by the additi a 
of angular momenta, as discussed in Appendix B. From the orthogonality of the Y/ m 's. this rest, t 
shows that/y = Z/ + 1. /,. /, — I. However, under parity Y / m -*■ (—I) ! Yf m (see Problem 9.15). anc 
thus for /dQ Y* ,„ / Y l m Y l . m . to be an even function (so that the integral is nonzero), l f canm 
equal I,. and therefore A I =1 1 — /,• = ±1. 
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The second term on the right-hand side of (14.172) can be rewritten as 

i A A A i „ 

— -(k • re p — e ■ rk • p) = — -(k x e ■ r x p) (14.173) 

Vs t* immediately recognize the orbital angular momentum operator L = r xp. Thus 
if itie electric dipole matrix element vanishes, following the type of argument that 
w e used to obtain the electric dipole Hamiltonian in (14.155), we find that 

p a h p » 

-A • —V ->■-B L (14.174) 

m e c i 2 m e c 

vs hich is of the form 

#! = -£ B * (14.175) 

where 

fL = ^-t (14.176) 

2 rn e c 

This expression for the magnetic moment of a particle with charge —e is the familiar 
result with which we started our discussion of the interaction of a magnetic moment 
in an external magnetic field in Chapter 1. See (1.2). The Hamiltonian (14.175) 
eon tributes to what are called magnetic dipole transitions. Since we have included 
the second term in the expansion for the exponential, we expect the size of the matrix 
element to be of order ka 0 smaller than for an electric dipole transition and the 
ransition rale to be of order (ka 0 ) 2 smaller. This suggests that the atomic transition 
ate for a magnetic dipole transition should be on the order of 10 6 times smaller than 
or electric dipole transitions, and the corresponding lifetime should be on the order 
jf 10 6 times longer. 

So far in our discussion of magnetic dipole transitions, we have neglected the 
xirt of the magnetic dipole Hamiltonian that depends on the intrinsic spin of the 
ilectron: 

-^-S • B (14.177) 

2 m e c 

n a nonrelativistic approach to the quantum mechanics of the electron, we must still 
nit this term into the Hamiltonian by hand. In a fully relativistic approach in which 
he electron is treated on the same basis as the photon and the Dirac relativistic "‘wave 
unction” becomes a quantum field operator, the intrinsic spin part of the interaction 
nters naturally. In any case, we can now see why we neglected (14.177) in analyzing 
n allowed electric dipole transition. The spin magnetic dipole Hamiltonian (14.177) 
vould make a contribution of the same size as the magnetic dipole Hamiltonian 
14 174), which is due to orbital angular momentum. 

Actually, you can show that even the magnetic dipole Hamiltonian cannot con- 
nbute to a direct transition from the 3d to the Is state. However, so far we have 
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neglected entirely the “symmetric” portion of the operator (14.172), that is, the first 
term on the right-hand side. This term leads to an electric quadrupole Hamiltonian 
that does connect the 3 d and l.v states (see Problem 14.18). Thus the transition rate 
to go directly between these two states with the emission of a single photon should 
be roughly a million times smaller than for a typical electric dipole transition. 

14.8 Cavity Quantum Electrodynamics 


One of the major consequences of expressing the electromagnetic field as a quantum 
operator is that it gives us a mechanism to understand phenomena like spontaneous 
emission. Otherwise, all the energy eigenstates of an atom would be stationary 
states. In our discussion so far, spontaneous emission appears to be an unavoidable 
consequence of the coupling between the matter and the vacuum. Surprisingly, it is 
possible to alter radically these vacuum states by enclosing the atom in a cavity. In 
fact, the transition rate of an excited stale to a lower energy stale can be completely 
suppressed. 

For a rectangular cavity, for example, with sides L x = L y — L and L z = d 
with perfectly conducting walls the allowed modes of the electromagnetic field are 
given by 


where 



A x = A 0[ cos k x x sin k y y sin k z z e “° l 
A y — A 0v sin k x x cos k y y sin k.z e~'"" 

A z = A 0 . sin k x x sin k y y cos k z z e~'" ,r (14.178) 

= ^ = ^ n x , n y , n z = 0, ±1, ±2,... (14.179) 


These modes satisfy the boundaiy condition that the tangential component of the 
electric field vanishes at the walls and take the place of the specific plane wave 
solution A = A 0 e' (k ' r_w,) that we used in free space. Within the cavity, the equation 
of motion for the vector potential is still given by the wave equation (14.30) and 
therefore the allowed frequencies that follow from the conditions (14.179) are 


= C \j k l + k ) + k I = C7T V 73 


L 2 d 2 


(14.180) 


The Coulomb gauge condition V • A = 0 also requires 

k x A x + kyAy A k z A. = — (n x A x + n y A y ^ 4 —- (/i 2 A^)=0 (14.181) 

Thus there are two independent polarizations, unless one of the integers n x , n y , and 
n. is zero, in which case (14.181) shows only one independent polarization exists. 
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Figure 14.7 Two large parallel conducting plates separated by 
a distance d can be viewed as a cavity lor transmitting atoms 
traveling along, say. the x axis. This is the same geometry as for 
the rectangular cavity whose modes are given in (14.180) except 
that four of the six walls of the rectangular cavity are absent. It 
is common, nonetheless, to refer to this configuration of parallel 
conducting plates as a cavity. 


Notice there all three of the integers cannot be zero, for then A vanishes, according 
to (14.178). 

Now if the dimensions of the cavity are sufficiently small, it is possible that all 
the energies hco n „ may be greater than the energy difference between an excited 
state and a lower energy state of the atom and consequently the atom can no longer 
decay into this lower energy state. On the other hand, it is also possible that one of 
the allowed frequencies of the cavity is resonant with the transition frequency, in 
which case the spontaneous emission rate is enhanced. 

A particularly nice example of inhibited spontaneous emission is the propagation 
of an atom through a cavity consisting of parallel conducting plates, as illustrated in 
Fig. 14.7. The experimental challenge is to find an atom with a transition wavelength 
long enough to permit construction of a practical cavity but one whose lifetime for 
spontaneous emission is sufficiently short that the atoms radiate while traversing the 
cavity, at least in the absence of any inhibition due to the cavity itself. Hulet, Hilfer, 
and Kleppner used an atomic beam of cesium atoms that had been prepared in the 
state n = 22, / = 21, m■ = 21. The only allowed electric dipole transition for such 
a large-n atom (commonly called a Rydberg atom) is to the state n = 21, I — 20, 
m = 20. For such large n, the spacing between adjacent energy levels is quite small, 
corresponding to a wavelength for the photon emitted in this transition of 0.45 mm. 
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Figure 14.8 The transmission of n = 22 cesium atoms through a cavity formed by 
two parallel conducting plates as a function of k/2d. where k is the wavelength of the 
n = 22, / = 21 to n = 21, / = 20 transition. The wavelength is altered by application of an 
electric field, which increases from 0 to 3.1 kV/cm for the data shown. The sharp increase 
in transmission at k/2d = 1 is due to the inhibition of spontaneous emission for k>2d. 
The decrease in transmission for k/2d > 1.015 is due to field ionization between the plates. 
Adapted from R. G. Hulct. E. S. Hilfer. and D. Kleppner. Phys. Rev. Lett. 55, 2137 (1985). 


The mode density with the electric field parallel to the surface of the conducting 
plates (the mode required for this transition) vanishes for d < k/ 2. 15 At first the 
width of the gap between the plates was set at 230 /am, slightly more than one- 
half the photon wavelength. Next the energy levels were shifted closer together by 
a few percent by an applied electric field (via the Stark effect) in order to increase 
the transition wavelength beyond the k/2 cutoff, leading to a large increase in the 
transmission of n = 22 atoms that do not decay in passage through the cavity, as 
shown in Fig. 14.8. Hulet et al. estimate that this corresponds to an increase in the 
natural lifetime by a factor of at least 20. 

THE CASIMIR EFFECT 

Another interesting consequence of the different mode structure of the electromag¬ 
netic field between two parallel conducting plates is the Casimir effect, which shows 
the effect of the zero-point energy that we discussed briefly in Section 14.3. 


15 See the article by E. A. Hinds in Cavin' Quantum Electrodynamics, P. R. Berman, ed.. 
Academic Press, San Diego, CA, 1994. p. 19. 
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For the region between the plates (see Fig. 14.7), the zero-point energy is given by 



H hc \ 


ir 2 n 2 z 


k,s 


k ' +k * + * 


(14.182) 


The allowed modes are ones in which an integral number of half wavelengths fit 
in the separation d between the plates. Thus there is a different mode structure in 
the region between the plates than occurs in free space. We can define a zero-point 
potential energy function as the work necessary to bring the plates together from 
infinity, namely 


V(d) = £„(</) - E 0 (oc) (14.183) 

Although E 0 (d) and £ 0 (oo) both diverge in the high-frequency (short-wavelength) 
limit, no material can behave as an ideal conductor at all frequencies. Thus it makes 
sense to impose a cutoff on these zero-point energy sums for short wavelengths. But 
the energy difference V(d), which arises from the difference in the mode structure 
between the plates and the mode structure in free space at the longer wavelengths 
(on the order of the spacing between the plates), turns out to be finite, independent 
of the cutoff, and has the value 


hcL 2 7T~ 

V(d) = - ■ (14.184) 

720</ 3 

Thus the pressure P (the force per unit area) attracting the plates is the negative 
gradient of V (d) divided by the area L 1 of the plates, namely 


P 


hen 2 
24 (W 4 


(14.185) 


Therefore, as indicated in Fig. 14.9. two uncharged conducting plates can be viewed 
as attracting each other by virtue of the vacuum fluctuations due to the electro¬ 
magnetic field. 16 For plates with a separation of 1 cm, this pressure is small, only 
1.3 x 10 -18 dyne/cm 2 . But if the separation is 1 micron, this pressure is 10 16 times 
larger. Hence in the world of micromachines, the Casimir force can be dominant. 


16 The calculation is too involved for us to carry out here, but you can get everything except the 
factor of .t 2 /240 from dimensional analysis. Sec Problem 14.19. The calculation was first done by 
Casimir in 1948. See P. Milonni. The Quantum Vacuum, Academic Press. 1994 for details. Milonni 
also discusses an alternative to the zero-point energy derivation. For a precise measurement of the 
Casimir force, see S. K. Lamoreaux. Phys. Rev. Lett. 78, 5 (1997) and U. Mohideen and A. Roy, 
Phys. Rev. Lett. 81,4549 (1998). A good starting point for additional reading on the Casimir effect 
is S. K. Lamoreaux, ,4m. ./. Phys. 67, 10 (1999). The Casimir effect is often considered to be 
direct evidence for the zero-point energy of the electromagnetic field. This is an issue of some 
importance, since the gravitational effects of the zero-point energy should make a contribution 
to the cosmological constant, a contribution that is seemingly much larger than the cosmological 
constant's observed value. 
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Figure 14.9 One can think of the force of attrac¬ 
tion between two uncharged conducting plates (the 
Casimir effect) as arising from a different mode struc¬ 
ture in the region between the plates relative to out¬ 
side the plates. In particular, those modes for which 
half the wavelength is greater than the separation 
between the plates would not satisfy the boundary 
conditions in the region between the plates. 


14.9 Higher Order Processes and Feynman Diagrams 


Using the procedures outlined in the preceding sections, you can calculate the 
transition rate for any first-order process involving the interactions of photons and 
atoms. In particular, it is straightforward to calculate the differential cross section 
for the photoelectric effect in hydrogen, where a photon ionizes the atom, kicking 
out a “free” electron. For the photoelectric effect, the final-state wave function for a 
sufficiently energetic electron can be taken to be a momentum eigenfunction, instead 
of the bound-state wave functions that we used in determining the lifetime of the 2 p 
state (see Problem 14.17). 

It is interesting to consider, at least conceptually, how you would calculate the 
cross section for a higher order process such as photon-atom scattering. In this case, 
we arc interested in the matrix element 

- <l k/>J/ | ® {n f l f , m f \H\\n h Z ; , m,) <g> |l u . t . v .) (14.186) 

Now the 

t—^-A 2 (14.187) 

2m e c A 

part of H ] can contribute to lowest order. The operator A 2 contains the appropriate 
photon operators s a k s to annihilate the initial photon and create the appropri- 
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ate final photon. Thus (14.187) can contribute to photon-atom scattering in first order 
(the operator acting once) in the expansion (14.116) for the time-development 
operator. 

There is another higher order pathway that contributes to the amplitude for 
photon-atom scattering as well. Consider the second-order contribution to the tran¬ 
sition amplitude in the expansion (14.116) 

(") f'dt' I' dt"(f\H u (t')H u (t")\i) (14.188) 

There is a nonzero contribution to (14.188) arising from the other part of the inter¬ 
action Hamiltonian. 

1 

—A--V (14.189) 

m e c i 

In practice, we evaluate (14.188) by inserting a complete set of energy eigenstates 

|/> of H 0 : 


( _ i) l dt ’L </f "E</ | " 1/ (r ' )|/) < / i^' / (r " )|/) (,4 - 190 > 

Notice that the operator (14.189) acts twice, once at t' and once at t”. Also note that 
t' > t". 

There are two different ways in which the operator (14.189) can contribute to 
the photon-atom scattering amplitude given in (14.190). One possibility is for the 
operator (14.189) to act at t" to annihilate the incident photon, with the atom making 
a transition to some intermediate state |/), and then the operator acts again at t' 
to create the final photon. We denote this transition amplitude graphically by the 
diagram in Fig. 14.10a. It is also possible for the operator (14.189) to act at l" to 
create the final photon, and then w hen the operator acts again at t' to annihilate the 
incident photon. This amplitude is represented graphically by Fig. 14.10b. It might 
seem strange that the final-state ph. ion can be emitted before the incident photon is 
absorbed. In particular, this means that for t " <t < t' both photons exist, as indicated 
in Fig. 14.10b. However, these intermediate states in Fig. 14.10 need not conserve 
energy; energy is conserved only when you wait for a long period of lime, as the 
experimentalist does when observing the incident and scattered photons. 

The graphical pictures shown in Fig. 14.10 for the quantum mechanical transi¬ 
tion amplitudes are known as Feynman diagrams. The diagrams are a convenient 
way of keeping track of the terms in the perturbative expansion (14.119) for the 
transition amplitude. Once you see how the analysis works, what the rules relating 
the amplitude to the diagram are. you can learn to write these amplitudes simply by 
constructing the possible diagrams that can contribute to a particular process. The 
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t 


(a) 




Figure 14.10 Time-ordered diagrams for calculating the amplitude for photon-atom 
scattering. Tn (a) the incident photon is absorbed first with a transition to an intermediate 
atom state |/}, and then at a later time the final-state photon is emitted. In (b) the final- 
state photon is emitted first with a transition to an intermediate state consisting of the 
atom and the incident photon and the final photon, and then at a later lime the incident 
photon is absorbed. In (c) the first-order amplitude is due to the A 2 term in the interaction 
Hamiltonian in which both the incident photon is absorbed and the final-state photon is 
created at the same time. This latter diagram, when turned sideways, is often referred to 
as a seagull diagram. 



diagrams are just a shorthand device that provide a powerful tool for calculating 
these amplitudes. 

The Feynman diagrams that arise in a full treatment within quantum electrody¬ 
namics (QED) of photon-electron scattering are actually easier to evaluate than those 
for photon-atom scattering, basically because the electron is not as complicated as 
the atom with all its bound states. In QED you treat the held that creates and annihi¬ 
lates electrons on the same footing as the vector potential that creates and annihilates 
photons. QED is the best theory of any kind in its agreement between theory and 
experiment. With it, quantities such as the g factor of the electron have been deter¬ 
mined to better than nine significant figures. Feynman has noted that if the distance 
between Los Angeles and New York were measured to this precision, it would be 
accurate to the thickness of a human hair. It is this sort of agreement, and the lack of 
any significant disagreement between theory and experiment, that has caused Feyn¬ 
man to describe quantum electrodynamics as the “jewel of physics—our proudest 
possession.” 17 


17 R. P. Feynman, QED: The Strange Theory of Light and Matter, Princeton University 
Press, Princeton, NJ, 1985. Although this book is intended for someone with no familiarity with 
quantum mechanics, it is nonetheless an excellent place to start your reading about quantum 
electrodynamics. 
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Problems 


14.1. By taking the dot product of Ampere's law. (14.21), with E and using the vector 
identity 


V • (E x B) = B • (V x E) — E • (V x B) 


show that energy conservation follows from Maxwell's equations in the form 

^ + V • S,> = —j - E 

where 

* 

h = — (E : + B : ) and S P = -(ExB) 

8tt 4tt 

Discuss the physical significance of each term in this equation. Suggestion: Integrate 
the equation over an arbitrary volume and use Gauss’s theorem to make the physical 
significance more transparent. 


14.2. Show that 



given periodic boundary conditions. See (14.34). 


14.3. The nonrelativistic wave equation is the Schrodinger equation 



s(* v ) * + 


For a relativistic free particle, for which £- — p 2 c 2 + »rc 4 , a natural wave equa¬ 
tion is 


or 


ih^- j xfr + m 2 c 4 i// 


vV - 


1 d 2 \fr 

c 2 a / 2 



T/f =0 


which is called the Klein-Gordon equation. Use this equation to show that there 
is a local conservation law of the form 

^ + V-j = 0 with j=— W*Vf-fV\lr*) 
dt 2m i 
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Determine the form of p(r, t). From this form for p, give an argument for why the 
Klein-Gordon equation is not a good candidate for a one-particle relativistic wave 
equation in place of the Schrodinger equation, for which p = 

14.4. The resolution of the problem outlined in Problem 14.3 is to treat the solution 
to the Klein-Gordon equation as a quantum field. 

(a) Verify that if we write 


V( r, t) = c. 


p(k-r—cut) 




2co \ k VV + “ k VV 


then <p is a solution to the Klein-Gordon equation 

2 * 1 d 2< P (mc\ 2 „ 

* = 0 

provided a) = y k" + (mc/h)- c. 

(b) One can show' that the Hamiltonian for this system is given by 

J N/*((;f) +T *** + (?) 


Show that if r<3 k . a k d = 0, f<5 k , u k ,J = 0, and [« k , « k ,] = 6 k k /, then the Hamil¬ 
tonian becomes 


fi=Y, hoj ( a k«k+ 


Argue that the field <p creates and annihilates (spin-0) particles of mass in and 
energy E = ypV 2 + m 2 c 4 and that these particles are indeed bosons; that is. 
it is possible to put more than a single particle in a state with momentum p. 

14.5. In order to see why the particles created by a scalar field must be bosons, 
consider an alternative procedure for quantizing this field. Try writing 

*’<■■■ *> = £ cJ—[ K —rfr + —AT— 


where the annihilation operators l\ and the creation operators b k obey the anticom¬ 
mutation relations 

(Mk'^Vk' {VM=o {%,%,}=o 

where the anticommutator is defined by 


[A. B} = AB + BA 
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(a) Show that these anticommutation relations require that there cannot be more 
than one particle in a state: 


W°> = o 


and thus the particles are fermions. 

(b) How would the result of your calculation for the Hamiltonian of the preced¬ 
ing problem for the scalar field be modified by using these anticommutation 
relations? It can be shown formally that there is no viable scalar field theory 
(the field essentially vanishes) when the field is quantized using anticom¬ 
mutation relations. On the other hand, for a spin-j Dirac field, the situation 
is reversed: commutation relations for the annihilation and creation opera¬ 
tors lead to problems—negative energies for free particles and, consequently, 
a lack of stability—while anticommutation relations produce a Hamiltonian 
with positive energies. 18 


14.6. Calculate {n ks ) and A« k t for the coherent state 




^k.s—^ 




14.7. Show that the probability P n of finding n k s photons in the coherent state 


la) = e -l“l 2 / 2 £ ^L\n Ks ) 




is given by the Poisson distribution 




14.8. For the coherent state 




the expectation value of the electric field is 

_ llirtia) 


(a|E|a) 


'V p 


|a|e(k. 5 ) sin(k • r — cot + 5) 


18 See R. F. Streaterand A. S. Wightman. PCT, Spin & Statistics, and All That. W. A. Benjamin. 
New York, 1964. 
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where a = \a\e' s . Show that 

(or|E 2 (or) = - 1 * ° [1 + 4|a| 2 sin 2 (k ■ r — cot + 6)] 

provided the contributions of photon states with k' ^ k, s' ^ 5 are neglected, as was 
the case in the beginning of Example 14.1. Use the result of Example 14.2 to show 
that the fluctuations in electric held strength for the eoherent state are the same as 
for the vacuum state |0 k s ). 

14.9. Calculate (or jB|of) for the coherent state |a). Verify that the expectation values 
for E (see Problem 14.8) and B obey Faraday’s law. 

The next three problems require time-dependent perturbation theory strictly 
within nonrelativistic quantum mechanics. The electromagnetic field in these prob¬ 
lems is to be treated as a classical field. The Hamiltonian in Problem 14.10 and 
Problem 14.11 is altered by the presence of an electric field E by the addition of a 
term 

H\ = -fae ■ E 

where is the electric dipole moment operator. 


14.10. A particle with charge-to-mass ratio ejm in a one-dimensional harmonic 
oscillator with spring constant k is in the ground state. An oscillating uniform electric 
field 


E(r) = Eo cos cot 



is applied parallel to the motion of the oscillator for t seconds. What is the probability 
that the particle is excited to the state \n)7 Evaluate the probability of making a 
transition when co = co 0 , the resonance condition. 


14.11. A hydrogen atom is placed in a time-dependent homogeneous electric field 

E(/) = E 0 e~ r/T 


where E 0 and t are constants. At t = 0 the atom is in its ground state. Calculate the 
probability that it will be in a 2 p state as t oo. 

14.12. A spin-4 particle is immersed in a constant magnetic field B {) in the z direction 
and an oscillating magnetic field By cos cot in the x direction. The spin Hamiltonian 
can be written in the form H = cw 0 S z + co 1 cos cut S x . See (4.34). Assume oq <<C <y 0 
and treat the time-dependent part as a perturbation. Calculate the probability that the 
particle is in the spin-down state in time t if it is in the spin-up state at t = 0. Evaluate 
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your result in the resonance region when co is near o> 0 . Compare your perturbative 
result with Rabi's formula (4.44). Suggestion: You may wish to review the material in 
Section 4.4. where an approximation made in deriving Rabi’s formula is described. 

14.13. 

(a) Show for the one-dimensional harmonic oscillator that the position and mo¬ 
mentum operators in the Heisenberg picture are given by 

" / \ - /r\\ Pxfj(O) 

x H (t) = Xf/(0) cos cot H- 2 — sin cot 

mco 

Px H (t) = p x ( 0) cos cot — mcox H ( 0) sin cot 

respectively, where x H (0)=x is the usual position operator in the 
Schrodinger picture and p XH (0) — p x is the usual momentum operator in 
the Schrodinger picture. 

(b) Show that the equal time commutator of the position and momentum operators 
in the Heisenberg picture is given by 

[*w(0, p Xll (t)] = ih 

What happens if times for the two operators are different? 

14.14. As discussed in Section 10.5, the allowed energies of the isotropic three- 
dimensional harmonic oscillator are given by 

E„ = ^ Hco n = 0, 1, 2, ... 

where n = 2n r 4- /. Thus the n = 0 ground state is an / = 0, or.v, state, while the n = 1 
first excited state is an /= 1, or p, state. For a particle with charge cj = e confined 
in this potential, calculate the transition rate R for the transition between the first 
excited state and the ground state with the emission of a photon. 

14.15. An unstable spin-? particle of mass m l initially in the spin state |+z) under¬ 
goes a magnetic dipole transition to a spin-4 particle of mass m 2 , emitting a photon. 
Use the spin Hamiltonian (14.177) to show that the angular distribution of the pho¬ 
tons is isotropic, provided we sum over the probabilities of making transitions to 
both the |+z) and |-z) states. 

14.16. Show that the number of stales, apart from spin, for an electron with energy 
between E and E 4- dE is given by 

V 

-—— p dtt dE 

(2n)W ey 
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14.17. Show that the differential cross section for the photoelectric effect in which an 
electron in the ground state of hydrogen is ejected when the atom absorbs a photon 
is given by 

da_ = H k f {e kj) 2 1 

d£2 m e co [(1/ao) + 

where q — k, — kj . Assume the ejected electron is sufliciently energetic that the 
wave function for the electron can be taken to be the plane wave 

e ik r r 

~7v 

where p f = hk f is the momentum of the electron and fik, is the momentum of the 
incident photon. Suggestion: The cross section is the transition rate divided by the 
incident photon flux, which is equal to c/V in the box normalization used in this 
chapter. 

14.18. 

(a) Show that the magnetic dipole Hamiltonian (14.175) yields a zero matrix 
element for the 3 d to l.v transition in hydrogen. Suggestion: Express the 
angular momentum operator L in terms of L + , L_, and L z . 

(b) Show that the electric quadrupole Hamiltonian can yield a nonzero matrix 
element for this 3 d to Is transition. Suggestion: Use the “trick” (14.152) to 
express the first term on the right-hand side of (14.172) solely in terms ol 
position operators. 

14.19. Use dimensional analysis to argue that the Casimir force per unit area between 
two uncharged conducting plates varies as d~ A , where d is the separation of the 
plates. How does the gravitational attraction of the plates vary with the separation 
of the plates? 
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Electromagnetic Units 




Let's start with Maxwell's equations in SI units: 


V • E= — 

£ 0 

V ■ B = 0 

_ „ 9B 

V x E =- 

dr 

„ n 9E 

V x B = |/ 0 j + no£ 0 — 

at 

In these units, the force on a charged particle q is given by 

F = qE + q\ x B 

We can use Gauss’s law (A. I) most easily in its integral form 




E • dS = 


^enclosed 

«0 


(A.l) 

(A.2) 

(A.3) 

(A.4) 

(A.5) 

(A.6) 


to determine the magnitude of the electric field from a point charge q. Using a 
spherical Gaussian surface of radius r centered on the charge, we find 


or the familiar 


E4nr 2 = — 
t’o 


£ = 


4: re 0 r : 


(A.7) 

(A.8) 
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Thus the magnitude of the force F between two charges q\ and q 2 separated by a 
distance r is 


F= -m- 

4rre 0 r- 

■vhich we can express in the form 

F = k,if 

r i 


(A.9) 


(A.10) 


with the constant k j = 1 /4;re 0 . In SI units, the unit of charge, the coulomb, is defined 
to be equal to 1 ampere-second, while the unit of current, the ampere, is actual!} 
determined by magnetic measurements. This is a natural set of “experimentalist’s" 
units because it is possible to make very accurate current measurements and hence 
fix the unit of current, and thus the unit of charge, quite precisely. Once we know 
the unit of charge, the force between two charges is determined. The value of the 
constant ky is found experimentally to be roughly 9 x 10 9 newton-meter 2 /coulomb 2 . 
See (A. 19). 

Another system of units that is commonly used for Maxwell’s equations is 
Gaussian units. These units can be described as “theorist’s” units, since they are 
somewhat impractical for use in the laboratory but much more useful than SI units 
for revealing the true structure and beauty of electricity and magnetism. In addition, 
they are more commonly used than SI units for describing microscopic phenomena. 
In these units we begin by first determining the unit of charge, not the unit of current. 
In Gaussian units we define the constant k\ to be unity. Then the force F between 
two charges is simply given by 



where the unit of charge is determined by the requirement that two units of charge 
separated by a distance of 1 centimeter exert a force on each other of 1 dyne. This 
unit of charge is called a statcoulomb. The corresponding electric field produced by 
charge q is just 

E = 1 (A. 12) 

r 1 

and consequently [compare (A.8) and (A. 1 )J 


V ■ E = 4np 


(A.13) 


in these units. 

Let’s see what happens to the rest of Maxwell's equations. In SI units. Ampere's 
law. (A.4), can be expressed in integral form as 


B • dr = p 0 J j dS + Mo £ o^; [ E • dS 


(A. 14» 
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where 


J j ‘ dS — ^enclosed (A. 15) 

is the current enclosed by the closed line contour on the left-hand side of (A. 14). 
For a collection of charges that form a current, the magnetic portion of the force 
(A.5) due to a differential length dl of current is given by d F = Id\ x B. Since the 
magnitude of the magnetic field produced by a long current-carrying wire is given by 


B = ^ (A. 16) 

2nr 

in SI units, the magnitude of the force per unit length between two parallel wires 
separated by a distance r carrying currents /j and I 2 is just 


L = (b\ VA 

I \4n) r 

We can also express this force per unit length in the form 


(A.17) 


(A. 18) 


In SI units it is the constant k 2 that is defined to be /x 0 /4 tt = 10 -7 newton/ampere 2 . 
This then determines the unit of current by the requirement that two long wires each 
carrying a current of 1 ampere separated by a distance of 1 meter exert a force per 
unit length on each other of 2 x 10 -7 newton/meter. Note that 


k\ _ 1 / 4n \ _ 1 

k 2 4tt£o \Mo/ £ oMo 


(A.19) 


where c is the speed of light. Thus measuring the speed of light determines experi¬ 
mentally the value of k { in SI units. In Gaussian units, on the other hand, where k y 
is defined, it determines the value of k 2 . 

We still have a little freedom in our choice of units left to play with. In particular, 
even though the force between the two current-carrying wires is determined, the 
magnetic field produced by the wires can still be adjusted. We introduce a constant 
a so that the magnetic portion of the Lorentz force is given by F = k~ ] q\ x B. or 
d F = a~ [ fell x B. Since it is the forces that we measure directly, not the magnetic 
fields, we can avoid changing any physics by adjusting the value of the magnetic 
field produced by the wire so that 


B =k 


Mo l _ 
2 nr 


(A.20) 
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More generally, wc need to modify the right-hand side of Ampere’s law, which we 
now express as 


V x B = 



4nk 2 j 


_1_3E\ 

c 2 dt ) 


(A.21) 


In Gaussian units wc use this freedom to choose k = c. This choice shows the inher¬ 
ently relativistic nature of magnetic effects [compare the magnetic force q (v/c) x B 
with the electric force c/Ej and brings the relativistic nature of electricity and mag¬ 
netism to the fore with the explicit appearance of the speed of light. It also means that 
in Gaussian units the electric field E and magnetic field B have the same dimensions. 1 
Thus in Gaussian units (A.21) becomes 


vr ,> 4 jt. 1 3E 

V x B = — j -|-— 

C c dt 


(A.22) 


Of course, E and B having the same dimensions means that Faraday’s law, (A.3), 
must also be adjusted in Gaussian units. We introduce a third constant k 3 : 


V x E = 



(A.23) 


Since the gradient on the left-hand side of (A.23) has the dimensions of 1/length, k } 
must have the same dimensions as 1/c. In fact, in order for electromagnetic fields 
to propagate at the speed of light, the constant k 3 must equal 1/c. Thus the full 
Maxwell’s equations in Gaussian units are given by 


V • E = 47rp 

V • B = 0 


V 


xE = — 


]dB 

c dt 


V 


„ 4tt . 13E 

xB = —jH-— 

c c dt 


(A.24) 
(A.25) 

(A.26) 

(A.27) 


with the force law 


F = qE + q(\/c) x B 


(A.28) 


Trading e 0 and /i n for the explicit appearance of c seems to be a step in the right 
direction. In fact, these units make more advanced fully covariant presentations of 


1 Since the torque on a magnetic moment n is given by |ixH. scaling up the magnetic field 
by a factor of c means that the magnetic moment of a current loop picks up a factor of 1/c in 
comparison with its value in SI units, as indicated in (1.1). 
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Maxwell's equations in terms of relativistic four-vectors and tensors both straight¬ 
forward and elegant. 2 

For completeness, we should mention one other system of units that is also 
attractive to theorists. If you are interested in making the full Maxwell’s equations as 
simple and as elegant as possible, you can eliminate the factors of 4 n that appear in 
the charge and current source terms of Gauss’s law and Ampere’s law at the expense 
of a slightly more complicated expression for the fields produced by these charges 
and currents. In this other system of units, known as Heaviside-Lorentz units, we 
take k\ = 1/4tt. The corresponding electric field produced by a point charge q is 
then E = q/4nr : . The explicit appearance of these factors of 4nr in the forms for the 
fields, however, is compensated for by their disappearance in Maxwell’s equations. 
From (A.19) we see that k 2 = 1/4jtc 2 . and therefore in Heaviside-Lorentz units 
Maxwell’s equations are given by 


V • E = p 

(A.29) 

O 

ec 

II 

o 

(A.30) 

__ _ j 19E 

VxB--(- 

c c dt 

(A.31) 

V IT l9B 

V x E =- 

c dr 

(A.32) 


with the force law still given by (A.28). Heaviside-Lorentz units are sometimes 
referred to as rationalized Gaussian units. 


2 A good trick (see Sections 10.2 and 11.5) for evaluating expressions such as energies of the 
hydrogen atom is to replace <r with (e 2 flic) lie since the quantity e 2 /hc = a {e 2 /Anemic in SI 
anits) is a dimensionless quantity' whose value is roughly 1/137. In this way you may never need 
I, - recall that the charge on an electron is 4.8 x 10“ 10 statcoulombs hi Gaussian units rather than 
the more familiar 1.6 x 10“ 19 coulombs in SI units. 
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APPENDIX B 


The Addition of Angular Momenta 


■t 


In this appendix we would like to investigate a simple way of adding the angular 
momenta j\ and j 2 together. Our goal is to determine the linear combinations of 
the basis states for two angular momenta that form eigenstates of total angular 
momentum. Rather than give a general proof that the values for the total angular 
momentum j run from |y t — j 2 1 to j l + j 2 in integral steps, we will take the specific 
example of adding spin 4 and spin 1 to illustrate a procedure that can be readily 
extended to the addition of any two angular momenta. 

We label in the usual way the two basis states for a spin-4 particle by 14. 4 and 
14, — 4) and the three basic states for a spin-1 particle by I. 1). 1.0 , and I. -1). 
By taking the direct-product of these basis states, we can form six two-particle basis 
states: 


lffl®U. 1>2 lil)l®|l,0) 2 |f 5>,®ll.-l>2 

if —ft ® Ik 1>2 If— fi®|l,0> 2 If -f,® II.-1)2 (B.l) 

We have arbitrarily chosen to call particle 1 the spin-4 panicle and particle 2 the 
spin-1 particle. As usual, we can drop the direct-product sy mbol ® w ithout gener¬ 
ating any confusion. 

The two basis states | f filU 1>2 and 1 4- -fill- - I'; are sometimes referred to 
as “stretched" configurations. What is special about these configurations is that the 
z component of the total angular momentum takes on its maximum and minimum 
values for these two states, respectively. For example, applying 

J z = J\ z 4* J 2 z (B.2) 


to these kets, we find 


545 
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(4 + 4)lj. 3>l|l. l>2=4li. l>2 + 4li l)llU 1)2 

= Ifi|I,i) 1 |l, 1>2 + *||, 1)2 

= M, $>iU.l>2 (B.3) 

Similarly, 

(4 + 4)15’ —|>lll» -1)2 = -|^l4 — 5>il!’ -1)2 (B.4) 

Clearly, none of the other four states in our set of basis states (B.1) has these 
eigenvalues for J z . 

Since the total angular momentum operator 

■* 

J 2 = (J, H- J 2 ) 2 = JT + J| + 2Ji - J 2 (B.5) 

commutes with the operator J z , these two operators must have eigenstates in com¬ 
mon. Consequently, the states \j, 1)2 and \j, —|)j|l, —1) 2 must each be an 

eigenstate of the total angular momentum J 2 . We label these states in general by 
| j, m). Since m = | and m = —| for these two states, respectively, you may be 
tempted to guess that these are both j = | states. To verify that this is the case, we 
express 2Jj • J 2 in terms of angular momentum raising and lowering operators, just 
as we didin (5.10): 


2J] ■ J 2 — J\ + Ji- + J\-Ji+ + 244 (B.6) 

Then we can apply the operator (B.5) to these states. For example, 

J 2 | 2 , Ihll, l>2 = (J? + J2 + 2 ji-j 2 )|j, ±),| 1 , 1)2 

= [l (l 4 - l) fi 2 + 1(1 + m- + J\+Jo- + J\-Ji+ + 244 j l|’ 5)1!!’ 1)2 

= [5 (5 + i) +1(1 + 1) + 22(1)] n 2 12, 2),|1, 1) 2 

- |> x |l, 1> 2 = |(| + l) A>,|1, 1> 2 (B.7) 

where we have taken advantage of the fact that the raising operator for each of the 
particles yields zero when it acts on a state that has the z component of the angular 
rnomenmm for that particle equal to its maximum value: 

4l5.5)i = 0 and / 2 +|l, 1)2 = 0 (B.8) 

Thus 

lf.|> = l5’5>ill. 1>2 (B-9) 
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Similarly, you can verify that 

-i> 2 = \ (| +1) n% -i) 2 (B.io) 

or 

If’ -f> = li’ -5>,H» - X >2 (B.ll) 

So far we have found two of the four j = 7 states. We can determine the other 
two by applying the lowering and raising operators for the total angular momentum 
to the ||, j) and ||, — 7) states, respectively. Using (3.60), we find 

J- If, |) = y§ ({+ l) - f (f - l) a If. 5 ) = \) (B.12) 

Since 


J- = + J 2 - 


(B.13) 


we also know that 

L\\, 1> 2 = •/,_!?, 5>il 1 . 1)2+4-Ij. i)iI1. 1)2 

= yji (l + l)- h \v 1>2 

+ y/l(\ + 1) - 1(1 - t) ft|f, i>i|l,0) 2 (B.14) 

Equating (B.12) and (B.14), we find that 

1>2 + V^2’ 2>ll ] * 0>2 (B.15) 

Either by applying the lowering operator again to (B.15) or applying the raising 
operator to (B.ll), we can show that 

If’ “I> = M ° >2 + V^lf’ ihl 1 - -!>2 <B.16) 

Thus we have determined all four of the 7 = 5 slates. 

Since our basis (B.l) is six dimensional, there are two states left over. These 
states turn out to be total angular momentum j = \ states. We can generate them by 
taking advantage of the fact that 5 1 f, \) = 0 , that is, the amplitude to find a state 
with j = m — 4 in the state j = 7 , m — 4 is zero. This is enough information to 
deduce that 
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r to an overall phase. Note that the two basis states | j, —4> il •- l^andlj, 4 > 1 11 • 0 : 
arc candidates to be involved in this superposition, since they are the only two oi 
the six basis states with the eigenvalue Cor the z component of the total angular 
•.omentum equal to Lastly, we can determine the total angular momentum state 
. either by applying the lowering operator J_ = + J 2 _ to the state (B.l ~) 

- by choosing the linear combination of the basis states with total m = — 5 that is 
rthogonal to the state (B.16). In this way we find 

lit -s> = Vslf -5>iU. 0>2 - M l^ 1 - “Da <B.18) 

(it course, we haven’t really proved that the two states (B.l7) and (B.l 8 ) are j = 
Mates, although this is consistent with the fact that there are two states remaining 
after we constructed the j = 5 slates. Real proof comes from applying J- to one of 
them. 

In conclusion, note that our initial two-particle basis consisted of six slates and 
that we have now determined the linear combinations of these states that are eigen¬ 
states of total angular momentum with j — 4 (four states) and j = j (two states). The 
procedure that we have used to determine these states can be utilized to add together 
any two angular momenta. For example, i f the system consists of two spin-1 particles, 
the two-particle basis is nine dimensional. The total angular momentum takes on the 
values 2, 1. and 0. We can start with the stretched configuration |2, 2) = 11, 1), 11, 1 : 
and apply the lowering operator (B. 13) to determine the other four j = 2 states. We 
then use orthogonality relation ( 1 . I| 2 , I) = 0 to determine the | 1 . I) state and ap¬ 
ply the lowering operator to determine the other two j — 1 states. Finally, we can 
take advantage of the orthogonality relations ( 0 . 0 | 2 , 0 ) = 0 and ( 0 , 0 | 1 , 0 ) — 0 to 
determine the single j = 0 state. We need two orthogonality relations in this case 
because there are three two-particle states, including 11 , 1 )]| 1 , - 1 ) 2 , 11 . — l)i|l. 1 > 
and 11 . 0 >xl 1 - O):- that can comprise the | 2 , 0 ), 11 . 0 ), and | 0 , 0 ) states. 

The amplitudes (j, ?n|(|./[, 1 ) 1 17*2- M 1 2 > 2 ) arc known as Clebsch-Gordan co¬ 

efficients. Although we have called the individual angular momenta spins in this 
appendix, these angular momenta could be orbital as well as spin angular momen¬ 
tum. Thus (R. 17), for example, could result from the determination of the total 
angular momentum states of a spin -5 particle that has orbital angular momentum 
/ = 1. Compare the specific results of this appendix with the more general results 
of adding spin 5 and orbital angular momentum / in Section 11.5. These Clebsch- 
Gordan coefficients are routinely tabulated, so you don't actually need to calculate 
them each time you need them, once you understand how they are obtained. 
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■i 


The Dirac delta function is actually not a function at all but a "generalized function.” 
or a "distribution” that is defined through the relation 

[ dx f(x)S(x - * 0 ) = f(x 0 ) (C.l) 

J —oc 

for any smooth function f(x). From (C.l) we conclude that 

<5 (Jc — ^o) = 0 x^x 0 (C.2) 


and by setting f (x) = 1 in (C.l) that 



dx <5(.r - A‘ 0 ) = 1 


(C.3) 


Thus the delta function is a "function” that vanishes everywhere except at a single 
point but nonetheless has unit area. You can think of a delta function as the li mit 
of a sharply peaked function of unit area (see Fig. C.l), as it becomes progressively 
narrower and higher. In this limit, the function /(.v) in the integral in (C. 1) can be set 
equal to its value at .v (J since this is the only region in which the integrand is nonzero, 
and then the constant /(.Vo) can be pulled outside the integral. 

We can derive a number of properties of delta functions, with the understanding 
that identities involving delta functions make sense only when the delta functions 
appear within an integral. To derive 


first consider the result 


8(ax) - — (5(a) 

l«l 


(C.4) 



dx f(x)8(ax ) = — 
a 




S(y) = -/(0) 
a 


a>0 


(C.5) 
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Figure C.1 A sharply peaked function with unit 
area. The Dirac delta function arises in the limit 
that the function becomes infinitesimally narrow' and 
infinitely high. 


where in the second step we have made the change of variables y = ax. Note that if 
a < 0 . this same change of variables would switch the limits of integration, leading to 



dx f(x)5(ax) — 

a 





a < 0 


(C. 6 ) 


—a 


These results can be combined together in the form of (C.4). Note that one of the 
corollaries of (C.4) is 


8(-x) = S(x) (C.7) 

The delta function is an even function. Another relation that follows from (C.4) is 

8{x 2 — a 2 ) — —[5(a — a) + <$(a + a)] (C. 8 ) 

2\a\ 

Since x 2 — a 2 vanishes at both x = a and x — —a, we can write 

<S(a' 2 — a 2 ) = 5[(x — a)(x + a)] 

— <5[2 a(x — a)] + <5[—2 a(x + a)J 

= -U«(x-tf) + «0c + a)I (C.9) 

2\a\ 

More generally, suppose /( x) is a function that has a zero at a 0 , that is, /(x 0 ) = 0. 
Expanding /(a) in a Taylor series about a 0 : 

C* ~ -fo) + - ' ' — f tO — ^o) + '' ’ (CTO) 

a=.V() \ / .r=xo 


f(x) = f (a 0 ) + ( C -J- 
dx 
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and taking advantage of (C.4) again, we obtain 


«[/(*)] = 


1 

\df/dx\ xmX0 


8(x ~.x 0 ) 


(C.ll) 


where we can safely ignore the higher order terms in the Taylor series because the 
delta function vanishes everywhere except at x — x 0 . 

When the delta function is multiplied by a smooth function within an integral, 
we can give meaning to the derivative of a delta function: 


f°° dx /(*)i-6(x) = f(x)S(x)\°° oo - r dx^y^-Hx) 

J-oo dx 00 J- oo dx 

dm 

dx 

where the second step follows from an integration by parts. Also the integral of a 
delta function satisfies 



(C.12) 



0 x < a 
1 x > a 


= 0(x — a) 


(C.13) 


where 0(x — a) is the standard step function. From this result, we also see that 

—0(x — a) = S(x — a) (C.14) 

dx 

A convenient way to represent a delta function is as the limit of a sequence of 
regular functions that have unit area but that grow progressively more narrow as 
some parameter is varied. Some examples: 


1. The function sin Xx/nx is plotted in Fig. C.2. This function is well behaved 
for any finite value of X. The width of the function is of order l/X, since the 
first zero of the sine function occurs when Xx — n. Moreover, the height of the 
function at the origin is X/n. Thus as X increases, the function grows narrower 
and taller. In fact, the normalization factor of 1 /jt has been chosen so that the 
function has unit area. Therefore, as X —► oo. the function behaves as a delta 
function: 


<5(x) = lim 


1 sin Xx 


\-foo n x 


(C.I5) 


2. An alternative way of expressing the representation (C. 15) of the delta function 
is especially useful. Since 


sin ax 
x 


2 

2 



dk e ikx 


(C.I6) 
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we can write 


i r 

S(x) = — / dke 

2tt J—oq 


ikx 


(C.17) 


3. Another representation of the delta function is given by 


5(.x) = lint J—e~ 

a—oc V 7T 


(C.18) 


as can be verified by using the results of Appendix D on Gaussian integrals. 
4. Finally, you can show that 


8 (x) = lim — - £ 


7T x 2 + e 2 


(C.19) 
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Gaussian Integrals 




We first wish to evaluate the integral 

r°° 2 

1(a) = / dxe~ ax (D.l) 

J —OC 

where Re a > 0. A useful trick is to consider the integral squared, 

/ oo . poo poo poo , 

dxe~ ax - / dy e~ ay = dx dy e ~ a(x +y ) (D.2) 
•OO J—00 J—OC J—00 

which can be easily evaluated by switching from Cartesian to polar coordinates: 

-r,*. r 

Jo Jo 


I 2 (a) ■ 


d0 e~ ar =- 


(D.3) 


Thus 


What about integrals such as 

/ OO , 

dx e -ax-+bx 

-oo 


(D.4) 


(D.5) 


Here we can convert the integral into one in which we can take advantage of (D.4) 
by completing the square in the exponent: 


2 a ( h V hl 

ax — bx = a \ x -- 

\ 2a J 4a 

Making the change of variables x' = x — ( b/2a ), we find 


(D.6) 


/(a, b) = r dx e~ ax2+bx = e b2/4u dx’ e~ ax ' 2 - e b2/4a j- (D.l) 
J—oo J—oc V O. 
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We can also evaluate integrals of the form 


rw -£ 


dx e~ ax x 2 


(D.8) 


by differentiating (D. 1) under the integral sign: 


*■>--££ 


dx e 


d fn _ 1 fW 

da\a 2 V n 3 


(D.9) 


This technique can be easily extended. For example, 


f 00 2 , 

/ dx e ax x 4 = —jl(a) 
J -00 


<P_ 

da 2 


(D.10) 


Finally, we should note that although we have derived (D.4) for Re a > 0, we can 
extend this result to include a in (D.l) being purely imaginary. This is most easily 
done through contour integration. First consider the closed integral 




dz e~ az 


2 


(D.ll) 


in the complex plane for the contour shown in Fig. D.l, with a real and positive. Since 
the integrand is analytic within the contour, the closed contour integral vanishes: 


f 


dz e~ az 


2 


= 0 


Writing 


z = re' 8 — r (cos 6 + i sin 9) 


(D.12) 


(D.l 3) 



Figure D.l A closed contour in the complex z = x + iy 
plane. The contributions on the circular arcs vanish as 
R -»• oo. 
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- c >ee on the circular arcs of radius R that the integrand is given by 

—ai? 2 (cos 20+i sin 28) 


(D.14) 


a hich goes exponentially to zero as R —► oo for 0 < 6 < n/A and tt < 9 < 5n/4 
Mnce cos 26 > 0 for these angles. Thus the contribution of the circular arcs to the 
ci ntour integral vanishes as R —> oo. We can parametrize the diagonal line by z = 
r • ~ 4 . with r running from oo to —oo. Since z 2 = r 2 e' n I 2 = ir 2 and clz — dr e'*/ 4 , 
we obtain 

<f dz dx e~ ax2 + e in,A f ~ dr e~ iar2 = 0 (D. 15) 

J J—oo J OO 

(D.16) 

V ia 

take J~i = e l7r>!4 . 


Consequently 



a hich is just the same as (D.4) with a replaced by ia, provided we 
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APPENDIX E 


The Lagrangian for a Charge q 
in a Magnetic Field 


■» 


How do we handle magnetic fields within the framework of a Lagrangian? Purely 
electric forces are easy. After all, the electric potential ip(r) is introduced in electro¬ 
statics as the work done per unit charge to bring the charge to the position r from 
some reference point, which is often taken as at infinity. Then the potential energy 
of a charge q is V = qtp and the Lagrangian is given by 

L = T - V = 5 /nv 2 — qq> (E.l) 


In terms of the Cartesian coordinates x { = x, = y, and *3 — z. the Euler-Lagrange 
equation of motion 


9L 

dXj 


dt \ dXj) 


for the Lagrangian (E. 1) is given by 




0 


This equation of motion is simply 


mxj 


d(p 



(E.2) 


(E.3) 


0E.4) 


Since the electric field E is given in electrostatics by E = — V<p, the equation of 
motion can be expressed in terms of vectors as the force law m a = F = q E. 

The full Lorentz force 


F = gE + q(\/c) x B 


(E.5) 
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includes velocity-dependent magnetic forces, which cannot be obtained from a 
. .. c:\mgian of the form (E.l) that is just the difference of the kinetic and potential 
energies. Since the magnetic force always acts at right angles to the velocity, it 
doesn't change the magnitude of the velocity and thus does no work. However, we 
can show that the Lagrangian 


. I 2 
L = -my 
2 


qtp + —A • v 
c 


(E.6) 


w hich differs from (E. 1) by the addition of a velocity-dependent term involving the 
. ector potential A, yields the Lorentz force (E.5) for the equations of motion. 

First, we note that the magnetic field B can always be expressed in the form 
B = V x A, since the magnetic field satisfies V • B = 0 and the gradient of a curl 
vanishes: 


V ■ B = V ■ (V x A) = 0 


Since for the Lagrangian (E.6) 


3 L . q 
— = mx. H— A: 
dx, c 


the canonical momentum p, — dL/dXj is given in vector form by 


In order to evaluate 


p — m\ -(- -A 
c 


±( d S\= m x l + i d -± 

1 c dt 


dt \dXiJ 

notice that A t = y(t), z{l), t J and therefore 

= *Al + v d ±L = 'tA + v . v a . 

dt dt “f dx: dt dt 
j =l 1 


Using 


3 L d<p q 3A 

— = -q~ + -V- 

dx/ dXj C dx. 


(E.2) becomes 


or 


dtp v 3A q (dAj \ 

‘) = 

dtp v 3A q ( 3 A.- „ a \ 

mxi= - c, i +q -c-^-c{-w +yVA 0 


(E.7) 


(E.8) 


(E.9) 


(E.10) 


(E.l I) 


(E.l 2) 


(E.l 3) 


(E. 14) 
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In vector notation, (E. 14) can be expressed in terms of the force F on the particle as 


or 


F = * 



1 9A\ 
c dr ) 


+ — [V(v • A) — (v • V)AJ 
c 


(E.15) 


F = qE + x (V x A) =qE + -v x B 


c 


c 


(E.16) 


as desired. 

Given the Lagrangian (E.6), we can determine the Hamiltonian in the usual way: 


h = Pi*i ~ L 

i=l 


i=i ^ 



1 . . q 

-mXjXj -q<p + - 

2 c 




(E.17) 


At first it appears that the vector potential has disappeared entirely from the Hamilto¬ 
nian. However, if we express the Hamiltonian in terms of the canonical momentum 
(E.9), we obtain 


,, (p-tyA/c ) 2 

H =--- +q<p 

2m 


(E.18) 


This suggests a mnemonic for the way to mm on electromagnetic interactions in 
terms of the Hamiltonian: take the energy for a free particle of charge q 





2m 


(E.19) 


and make the replacements p -» p — c/A/c and E —>■ E — qtp to generate (E.18), 
with the energy E replaced by the symbol for the Hamiltonian. 
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APPENDIX F 


Values of Physical Constants 


1 A = 0.1 nm = 10 

“10 

m 

1 eV = 1.602176487(40) x 10~ 19 J 


1 fm = 10 -15 m 


= 1.602176487(40) x 10~ 12 erg 


1 bam = 10~ 2S m 2 


1 MeV = 10° eV 



1 dyne = 10 5 newton (N) 

1 eV/c 2 = 1.782661758(44) x 10" 36 kg 


1 gauss (G) s 10 4 

tesla (T) 

2.99792458 x 10 9 esu = 

1 coulomb (C) 


1 erg = 10“' joule (J) 

0° C s 273.15 K 



Quantity 

Symbol 

Value 

Gaussian 

SI 

Speed of light 

c 

2.99792458 

10 K) cm/s 

10 8 m/s 

Planck’s constant 

h 

6.62606896(33) 

10 -27 erg s 

10 -34 J s 


h = h/In 

1.054571628(53) 

10- 27 erg s 

IO" 34 J s 



6.58211899(16) 

lO" 16 eV s 

IO -16 eV s 

Electron charge 

e 

1.602176487(40) 

— 

O 

1 

tc 

n 



4.80320427(12) 

10~ 10 esu 

— 

Electron mass 

m e 

9.10938215(45) 

lO" 28 g 

10' 31 kg 



0.510998910(13) 

MeV/c 2 

MeV/c 2 

Proton mass 

m p 

1.672621637(83) 

io- 24 g 

10- 27 kg 



938.272013(23) 

MeV/c 2 

MeV/c 2 

Neutron mass 

m„ 

939.565346(23) 

McV/c 2 

MeV/c 2 

Reciprocal 

1/or 

137.035999679(94) 



fine-structure 





constant 





Bohr radius 

"0 

0.52917720859(36) 

IO" 8 cm 

10“ 10 m 

(h/m e ca) 





Bohr magneton 

Mb 

5.7883817555(79) 

10~ 9 eV/G 

10 -5 eV/T 

(eh/2m e c) 





Boltzmann 


1.3806504(24) 

IO" 16 erg/K 

IO" 23 J/K 

constant 





Avogadro 


6.02214179(30) x 10 23 /mol 


constant 





Values from J. Phys. 

G: Nuc. Pan. Phys. 37. 075021 (2010). 




Page 577 (metric system) 


561 





APPENDIX G 


Answers to Selected Problems 


I. 1 1.2 x 10 3 G/cm 
2.21 0.12 

2.22 \f2>Nh/2 

4.12 sin 4 (oj 0 r/2) 

6.14 A.r Ap x = 0.57/1 

6.17 (a) V30 /L 5 (b) 960/tt 6 (c) 5h 2 /mL 2 
6.19 -h 2 k 2 /&nb 2 

7.13 0.16 
9.9 1.13 A 

10.4 0.24 
10.7 0.70 

10.11 (a) (2A0) 2 ti 2 /2iia 2 , ( 5.52) 2 H 2 /2na 2 , (S.65) 2 H 2 /2^a 2 
(b) (2A0) 2 H 2 /2na 2 , (3.83) 2 /j 2 /2/xa 2 . (5.14) 2 ^ 2 /2yu« 2 

II. 7 (Z>) Eg 1 = (2/5)(e 2 /a 0 )(R/a 0 ) 2 , Eg = (1/1120)(^/a 0 )(^/ao) 4 

12.5 £ < —h 2 X 2 /4nmb 2 


563 
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Index 


Accidental degeneracy, 360, 368, 375 
Action. 289 

Active transformation, 53, 197 
Adjoint matrix, 50 
Adjoint operator. 35. 66 
Aharonov, Y.. 487 
Aharonov-Bohm effect. 483—187 
Alvarez, L., 447 
Ammonia molecule. 128-134 

energy-time uncertainty relation and. 134 
maser, 134 

orbitals and shape, 336 
perturbation theory and, 395-398 
in static electric field, 131-133, 395-398 
in time-dependent electric field, 133-134 
Angular momentum, 75-106 
addition of, 545-548 
spin-orbit, 401—404 
spin-spin, 147-150 
commutation relations, 79. 104 
conservation of, 118.138 
eigenstates and eigenvalues. 82-89 
eigenvalue problem 
spin-1. 100-103 
spin-j, 94-98 

lowering and raising operators, 85-86 
matrix elements of, 90-91 
operators. 36, 65, 79 
orbital eigenfunctions. 331-337 
uncertainty relations, 91-93, 105 
Annihilation operator. 254, 276, 498 
Antibonding orbital, 444 
Anticommutation relations, 499. 535 
Anticommutator. 106, 534 
Antisymmetric states, 422 
Anyons, 421 
Aspect, A.. 164 

Balmer series, 353 


Basis states, 11,23 

for two spin-4 particles, 141-143 
Bell, J„ 156 
Bell's inequality, 161 
Bethe. H.. 408 
Birefringence. 63 
Bohm. D.. 487 
Bohr. N„ 4. 261 
Bohr magneton. 26, 177 
Bohr radius, 355 
Bonding orbital, 444 
Bom. M„ 194 

Bom approximation, 458-462 
validity of, 462-463 
Bom interpretation, 194, 237 
Bom-Oppenheimcr approximation, 442 
Bose-Einstein statistics, 422 
Bosons, 422 
Bra(c)ket. 12, 22 
Bra vector, 12. 22 
Brcit-Wigner formula, 474 

Casimir, H., 529 
Casimir effect, 528-529 
Cavity quantum electrodynamics, 526-529 
Center-of-mass and relative coordinates. 
311-313 

Central potentials 

bound states of. 345-376 
conservation of orbital angular momentum. 
317 

scattering from. 458-477 
Centrifugal barrier, 347 
Classical path, 300 
Clebsch-Gordan coefficients, 548 
Coherent states, 262-269 
for photons, 500 
Commutation relations 

angular momentum. 79, 104 
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Commutation relations (continued) 
energy-time, 134-136 
position-momentum, 306 
Commutator, 79, 104 
Commuting operators, 80-81 
Completeness, 25, 43 
Completeness relation, 43, 68, 193 
Complete set, 23 

Complete set of commuting observables, 
321 

Complex numbers 

and quantum mechanics, 13, 20 
Conservation 

of angular momentum, 317 
of energy, 114, 514-515 
of linear momentum, 200, 309-311 
of probability, 226, 455 
Constant of motion, 114, 200 
Correlations in a spin-singlet state, 152-155 
Correspondence principle, 260, 263, 343 
Cosmic background radiation, 326 
Coulomb gauge, 488 
Coulomb interaction, 143 
Coulomb potential, 313, 348 
Covalent bond, 441 
Creation operator, 254, 276, 498 
Cross section 
differential, 456 
partial wave, 469 
Rutherford, 464 
total, 456 
Curie, P„ 178 
Curie constant, 178, 188 
Curie’s law, 178 

Darwin term, 405-407 
de Broglie relation, 204 
Degeneracy, 81 

accidental, 360, 368, 375 
for harmonic oscillator, 373-375 
for hydrogen atom, 359-360 
Degrees of freedom, 10, 191, 199 
Delta function normalization, 194 
Delta function potential, 242 
Density of states, 516-517 
Density operator, 171-181 
reduced, 179 
Deuterium, 353-354 
Deuteron, 360 

finite spherical well and, 360-364 
fusion and, 447 
Diatomic molecules, 321-326 
Diffraction, 300 


Dipole moment 
electric, 131 
magnetic, 1 
Dirac. P„ 247 

Dirac delta function, 194, 235, 549-552 
representations of, 551—552 
in three dimensions, 304 
Dirac equation, 401,405—407 
Direct product, 142, 183 
Double-slit experiment, 210-212, 281, 
295-297, 485-487 

Effective potential, 320, 348 
Ehrenfest, R, 3 
Ehrcnfest’s theorem, 200 
Eigenbra, 255, 265 
Eigenfunction 
energy, 214 
momentum, 203 

orbital angular momentum, 331-337 
parity, 273-274 
Eigcnket, 38, 234, 265 
Eigenstate, 38, 67 
Eigenvalue, 38, 67 
Einstein, A., 154, 165 
Einstein-Podolsky-Rosen paradox, 155 
Electric dipole 

approximation, 520 
moment, 131 
selection rule, 507, 524 
transition, 524 

Electromagnetic units, 539-543 
Electron 
g factor, 2 
Energy 

conservation of, 138, 514-515 
eigenstates and eigenvalues, 113 
operator (Hamiltonian), 113, 137 
uncertainty relation, 134-136 
Entanglement, 166-169, 179-181 
Evolutionary time, 135 
Exchange operator, 420 

eigenstates and eigenvalues of, 420-422 
Exchange term, 431 
Expectation value, 15, 24, 58, 67 
time dependence of, 114 

Fermi-Dirac statistics, 422 
Fermi gas model, 365 
Fermions, 422 
Fermi's Golden Rule, 518 
Feynman, R„ 7, 281,408, 532 
Feynman diagrams, 530-532 
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Fine structure, 408 
Fine-structure constant, 351 
Fine structure of hydrogen. 408 
Fourier transform. 204. 206. 237 
Free particle 

Hamiltonian, 208 
propagator, 284 
Functional integral, 289 

Gauge 

Coulomb, 488 
transformation. 484 
Gaussian integrals. 553-555 
Gaussian units, 540 
Gaussian wave packet, 204-208 

minimum uncertainty state. 207-208 
time evolution, 208-210 
Generator 

of rotations, 36-37. 65 
of time translations. 112, 137 
of translations. 197-200. 236. 304. 338 
Gerlach, W„. 1, 4 
g factor, 2-3 

magnetic resonance and, 124 
spin precession of muon, 119-120 
Goudsmit, S., 2 
Green’s function, 459 

Hamiltonian, 113, 137 
Hard-sphere scattering, 469-471 
Harmonic oscillator, 245-276, 369-375 
coherent states of, 262-269 
eigenfunctions, 254-257 
in external electric field. 386-389 
large-n limit. 259-261 
lowering operator, 249. 274 
number operator. 248 
parity and, 273-274 
raising operator, 249. 275 
three dimensional, 369-375 

in Cartesian coordinates, 370-372 
degeneracy. 373-375 
in spherical coordinates, 372-373 
time dependence. 261-262. 264-269 
zero-point energy. 257-259 
Hartree, D„ 437 
Hartree method, 437 
Hcaviside-Lorentz units, 543 
Heisenberg picture. 508-509 
Heisenberg uncertainty relation. 200, 210, 236 
Helicity, 64 
Helium atom, 424-434 
excited states, 428-432 


ground state, 424-428 
variational method and, 433-434 
Hermite polynomial, 272 
Hermitian operator, 37,67 
matrix representation of, 50 
Hidden-variable theory, 154 
Hydrogen atom, 348-360. 398-410 
fine structure, 408 
hyperfine structure. 143-147 
lifetime of 2 p state, 521-524 
radiative transitions, 518-526 
uncertainty principle and, 313-314 
Hydrogen maser. 146 
Hydrogen molecule, 447—448 
Hydrogen molecule ion. 442-446 
Hyperfine interaction. 143-147 
Hyperfine structure, 409 

Identical particles, 419—448 
bosons, 422 
fermions. 422 
helium atom and. 424—432 
hydrogen molecule and, 447—448 
Identity operator. 41,46, 68. 235 
Impact parameter. 465 
Inner product, 12. 142 
Interaction picture, 510-513 
Interference, 9, 45. 210, 281. 300 
2k rotations and. 120-122 
gravity and, 297-299 
Intrinsic spin, 2 
Invariance, 303 
gauge, 487 
under inversion. 274 
rotational, 118. 138. 314-317, 320 
translational, 200. 309-311 
Inversion symmetry. 274 
Ionic bonding, 440 
Ionization energy of elements. 438 

Ket vector. 5. 10, 12, 22 
Klein-Gordon equation. 533 
Kronecker delta. 23 

Lagrangian, 288 
Laguerre polynomials, 351 
Lamb, W„ 408 
Lamb shift. 408 
Laplacian 

in cylindrical coordinates, 344 
in spherical coordinates. 330 
Large-n limit 

of harmonic oscillator, 259-261 
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Least action principle, 291-296, 300 
Legendre polynomials, 334, 342 
Legendre’s equation, 342 
Lifetime, 518 
Linear operator, 34 
Linewidth, 136, 146 
Lorent/., H., 3 
Lorentz force 

in Gaussian units, 542 
in SI units, 539 
Lowering operator 

for angular momentum, 86 
for harmonic oscillator. 249 
Lyman scries, 353 

Magic numbers, 368 
Magnetic dipole 
moment, 1-3 
transition, 525 

Magnetic resonance, 124-128 
Maser, 134, 146 
Matrix element, 47, 69 
Matrix mechanics, 29-32, 46-51, 69-70 
Matrix representation 
of bras, 30, 69 
of kets, 30, 69 
of operators, 47, 69 
Maxwell’s equations, 539-543 
in Gaussian units, 542 
in Heaviside-Lorentz units, 543 
in SI units, 539 
McKellar, A., 326 
Measurement 

of position, 191-192 
of spin, 3-5 

Minimum uncertainty state, 207, 258, 267 
Mixed state, 171, 183 
Molecules, 321-326 

covalent bonding, 441 -448 
orbitals and, 336-337 
rotation of, 323-325 
vibration of, 321-323 
vibration-rotation of, 325 
Moment of inertia, 319, 323 
Momentum 

conservation of, 309-31 I 
eigenfunction, 203, 307 
operator, 198 

in position space, 202, 307 
Momentum space, 202-204 
Multielectron atoms. 437^141 
Muon 

catalysis, 446-447 


spin precession of, 119-120 
Muon-catalyzed fusion, 446^147 
Muonic atom, 446 

Neutron interferometer, 121 
In rotations and, 120-122 
gravity and, 297-299 
No-cloning theorem, 166, 169-170 
Normalization 
continuum, 235 
discrete, 23 
Number operator, 248 

Observable, 22 
Operators, 34, 65 
adjoint, 35, 50, 66 * 
eigenstates and eigenvalues of, 38, 67 
Hermitian, 37, 67 
identity, 41,46, 68, 70 
linear, 34 

matrix elements of, 47 
product of, 51 
projection, 42, 68 
unitary, 35, 66 
Optical theorem, 469 
Orbital angular momentum 
eigenfunctions, 331-337 
generators of rotations, 315 
operators, 315 

in position space, 328-330 
Orbitals, 336 
Orthonormal set, 23 
Outer product, 142 
Overlap integral, 444 

Pais, A„ 165 
Parity, 223 

eigenfunctions, 273 
operator, 273 

Partial wave analysis, 465-469 
for finite potential well, 471-473 
for hard-sphere scattering, 469-471 
resonances and, 473—176 
Particle in a box 

one-dimensional, 219-224 
three-dimensional, 365-368 
Paschen-Bach effect, 416 
Paschen series, 353 
Passive transformation, 53,197 
Path integrals, 281-301 

Aharonov-Bohm effect and, 485—487 
for free particle, 289-291 
gravity and, 297-299 
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phasors and, 293-295 
principle of least action and, 291-296 
Path of least action, 292 
Pauli exclusion principle. 423,448 
Pauli spin matrices. 96. 105 
Pckeris, C„ 434 
Penzias, A.. 326 

Periodic boundary conditions, 490 
Periodic tabic, 438 

multielectron atoms and. 437^141 
Perturbation theory 
degenerate. 389-391 
nondegeneratc. 381-386 

harmonic oscillator and. 386-389 
relativistic perturbations in hydrogen and. 
398^107 

time-dependent, 504-506 

harmonic oscillator and, 506-507 
Perturbing Hamiltonian. 381 
Phase 

overall, 18, 22 
relative. 25 
Phase shift, 467 
Phase space, 516 
Phasors, 293 
Photon, 59-64, 495-498 

circularly polarized, 62-64, 498 
energy and momentum. 495—498 
intrinsic spin of, 64. 498 
linearly polarized. 60-62 
Photon-atom scattering, 530-532 
Picture 

Heisenberg, 508-509 
interaction, 510-513 
Schrodinger, 507-508 
Pion 

as a bound state of quarks, 150 
decay of, 119 

scattering with proton, 475-476 
Planck’s constant, 4 
Plane wave 

partial wave expansion, 466 
Podolsky, B., 154 
Poisson distribution, 501,535 
Poisson’s bright spot, 471 
Polarization 
circular, 62 
linear, 60 

Position eigenstate, 191 
Position-momentum uncertainty, 200. 207. 
210,236 

Position-space wave function 
in one dimension, 195 


in three dimensions, 304 
Potential 

harmonic oscillator, 246 
Poynting vector. 496 
Precession 

of muon, 119-120 
of neutron. 120-122 
of spin-4 particle, 115-118 
Probability. 9, 13, 24 

conservation of, 111-112 
Probability amplitude, 9, 22 
Probability current, 226 
Probability density. 454 

in momentum space, 203, 307 
in one dimension,-.-194 
in three dimensions, 304 
Projection operators, 42, 68 
matrix representation of. 48 
Propagator, 283 
free particle. 284 
Pure state. 171, 183 

Quantization of radiation field, 493-499 
Quantum electrodynamics (QED), 408, 
532 

Quantum teleportation. 165-169. 180-181 
Quarks, 2. 150.452 

Rabi’s formula. 126 
Radial equation, 320 

behavior at origin, 345-347 
Radiation 

blackbody. 492 
electric dipole, 520-524 
magnetic dipole, 524-526 
spontaneous emission, 518-528 
stimulated emission, 126,519 
Raising operator 

for angular momentum. 86 
for harmonic oscillator, 249 
Ramsauer-Townsend effect, 477 
Reduced density operator, 179 
Reduced mass. 312 
Reflection coefficient. 228 
for step potential, 229 
Relative and center-of-mass coordinates, 
311-313 
Resonance 

magnetic, 124-128 
scattering. 473—176 
Retherford. R., 408 
Rigid rotator. 323. 343 
Rosen, N„ 154 
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Rotation 

matrices, 54. 62 
of molecules, 323 
operators. 33-40, 65 
Rotations 

by 2,t radians, 40, 122 
generator of, 36, 65 
noncommutativity of, 75 
Rutherford scattering. 463^65, 480 
Rydberg atom, 527 
Rydberg constant, 353 

Scattering, 451—478 
amplitude, 458 

asymptotic wave function. 454 
Bom approximation, 458-463 
differential cross section, 456 
one-dimensional, 224-231 
optical theorem, 469 
partial waves, 465-469 
photon-atom, 530-532 
pion-proton. 475—476 
by potential barrier. 230-231 
resonant, 473^176 
Rutherford, 451,463-465 
by a step potential, 226-230 
total cross section. 456 
Schrodinger equation, 112, 137 
in position space, 214 
in three dimensions, 319, 330 
time-dependent, 214 
time-independent, 214. 232 
Schrodinger picture, 507-508 
Schwarz inequality, 92 
Selection rules, 324, 325. 343,507,524 
Self-adjoint operator, 37,67 
Separation of variables, 371 
SG device, 5 
modified, 7,41 
Shell structure 
of atom, 358. 405 
of nucleus, 368 
Silver atom 

and Stem-Gerlach experiment, 3-5, 26 
Singlet spin state, 150, 182 
Single-valuedness, 329 
Solid angle. 332 

Spectroscopic notation, 359. 404, 425 
Spherical Bessel equation. 366 
Spherical Bessel functions. 366, 466 
Spherical coordinates, 316 
Spherical harmonics, 331-337 
Spherical Neumann functions, 366, 466 


Spherical well 
finite, 360-364 
infinite, 365-368 
Spin, 1-5 

measurement, 5 
precession, 115-118 
Spinor, 405 

Spin-orbit coupling, 364, 368, 400-405 
Spin(s) 

addition of. 147-152 
Spin-spin interaction 

hyperfine splitting of hydrogen, 143-147, 

" 150 

Spin-statistics theorem, 423 
Square-well potential 

in one dimension.*214-219 
scattering, 471-473 
in three dimensions, 360-363 
Standard deviation 
and uncertainty. 16 
Stark effect. 391-395 
Statcoulomb, 540 
State vector 

quantum, 10-14 
Stationary state, 114 
Statistical fluctuations 
and measurement. 17 
Statistics 

Bose-Einstein. 422 
Fermi-Dirac. 422 
Stem. O., 1,5 

Stem-Gerlach experiment, 3-5 
Stimulated emission, 126,519 
Superposition, 13. 22. 25 
Symmetric stales. 422 
Symmetry' 

under exchange, 423 
in quantum mechanics. 339 
rotational, 317 
translational, 309-311 
Symmetry operation, 118, 138 

Teleportation. See Quantum teleportation 
Thomas, L., 401 
Thomas precession, 401 
Time-dependent perturbation theory, 504-506 
Time derivative 

of expectation values, 114 
of operator, 114 

Time evolution operator. Ill, 137 
Transformation 
active, 53, 197 
gauge, 484 
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passive, 53,197 
Transition 

electric dipole, 524 
electric quadrupolc. 526 
magnetic dipole, 525 
rale. 517-518 
Transition amplitude. 283 
Translational invariance, 309-311 
Transmission coefficient, 228 
for square barrier, 231 
for step potential, 229 
Triplet spin state. 150, 182 
Tunneling, 230-234 

ammonia molecule and. 130 
ethylene and. 251 
Two-term recursion relation, 271 

Uhlenbeck. G„ 2, 401 
Uncertainty 

definition of. 16, 24 
Uncertainty' principle 

estimating energies and, 313-314 
Uncertainty relations 
angular momentum. 93 
derivation of, 92-93 
energy-time, 134-136 
Heisenberg, 200, 236 
position-momentum. 200. 210, 236 
Unitary' operator, 35. 66 
Unperturbed Hamiltonian, 381 


Urey. H„ 354 

Vacuum state, 495 
Variational method, 432—433 
for helium, 433-434 
for hydrogen molecule ion. 442-446 
Vector operators. 82. 316 
Vector potential, 483 

Aharonov-Bohm effect and, 484-487 
operator, 494 

Vibration of molecule, 321-323 
Virial theorem. 378 

Wave function 

in momentum space, 203 
in position space. 195 
Wave packet 

Gaussian, 204-207 
scattering, 452 
spreading, 209 
Wcmer, S. A., 120 
Wilson. R„ 326 

Yukawa potential, 463 

Zeeman. P.. 410 
Zeeman effect, 410—412 
Zero-point energy, 257-259 
behavior of helium. 258 
of electromagnetic field, 495 
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